FIGURE 1 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT CTGAAGACT AACAT TT T GTGAAGT T GTAAAACAGAAAACC TGT TAGAAATGT GGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
AT CAGT T T T GCACAGTGGCAAT T T TGGGACT GATT TAGAACAGAAACTCCAT TGGAACCCCGAGG 
ACAAAGGTTATGTGCT T CACATGATCACTACTGCAGCAGAATGGTC TATGTCATTT TCCTT CT TT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T GGATTAACCC TCTAT GACACTGCACCT T GCCC TAT T AACAATGAACGAACACGGCTACT T T CCA 
GAGATAT TTGATGAAAGGATAAAATATTTCTGTAATGAT TATGATTCTCAGGGAT TGGGGAAAGG 
T TCACAGAAG T TGC T TAT T CT TC TC TGAAATT T TCAACCACT TAAT CAAGGC TGACAGT AACAC T 
GATGAATGCT GATAATCAGGAAACATGAAAGAAGCCATTTGATAGAT TAT TCTAAAGGATATCAT 
CAAGAAGACT AT TAAAAACACC TATGCCTATACTT T T T TAT CT CAGAAAATAAAGTCAAAAGAC T 
ATG 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8,39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC IAT I YVRYKQVHALS PEENVI IKLNKAGLVLGILSCLGLS I VANFQKT TL FAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

p Important features: 

Type II transmembrane domain: 

J:jJ amino acids 13-33 

\j Other Transmembrane domains: 

* amino acids 54-73, 94-113, 160-180, 122-141 

m 

N-myristoylation sites, 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACT TCGCCATGAGT TT CCTCATCGACT CCAGCAT CATGATTACCTCCCAGATACTATT TTTT G 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAA.TCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTT T CT GGGGAATGATAAAAAGTGTTACCACT TCAGCAT CAGGAAGTGAAAATCT TACT 
C T TAT T CAACAG GAAGT GGAT GCT T T GGAAGAATT AAGCAG GC AGCTT T TT C T GGAAACAG CT GATC T 
ATAT GCTACCAAGGAGAGAATAGAATACT CCAAAACCTT CAAGGGGAAATAT T TTAATTTT CTTGGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
G GGAAAAC G GAT C C T GT CAC AAGAG GC AT T GAGAT CACT G T GAAT TAT C TG G GAAT C CAAT T T GAT GT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAAT7UVTCATCGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACAT T TTATAAACAAACAAAAT GCTATGGTAGC 
ATTT TTCACCTT CATAGCATACT CCTT CCCCGTCAGGTGATACTATGAC CAT GAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
C TAT GGTAGCTGAGCCAAACACGTAGGAT TTCCGT T T TAAGGT TCACAT GGAAAAGGTTATAGCTTTG 
CCTT GAGAT T GACT CAT TAAAAT C AGAGACT GT AAC AAAAAAAAAAAAAAAAAAAAAGGG C GGC C GC G 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE d 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAWCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLG I QFDVKFWSQH I S FI LVG 1 1 1 VT S I RGLL I TLTKFFYAI S S SKSSNVI VLLLAQ IMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQ FNFYHRWFDVI FLVSAL S S I LFL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE g 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATGG CCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTGCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
, . AT TATCAC CACCAAAGAT CCCATAT TCAACACT CAAACT GCAACACAAACAACAGAAT T T ATT GT 

J** CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CI CTGC TCCAGCT TCCACT TCTATTCCACGGAGAAAAAAAT TGAT TTGTGT CACAGAAGTTTT TATG 

Q GAAACTAGCACCATGTCTACAGAAACT GAACCATTTGTT GAAAATAAAGCAGCATTCAAGAAT GA 

HI AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
Jjf CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
TGAGGAATCAAAGAAAAC TGATAAAAACCCAGAAGAGT CCAAGAGT CCAAGCAAAAC TAC CGTGC 
M 1 GAT GCC T GGAAGC TGAAGTT TAGATGAGACAGAAATGAGGAGACACACCT GAGGCT GGTT TC TT T 

y CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCT TGGT TCC TAAC TGGAATCAGC TCAGGACT GCCAT T GGAC TATGGAGT GCACCAAAGAGAAT 
1* GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
W TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
ill CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
Q AAAGCCAAGGAGT CAC T GAGACCAAGGC T T T C T C TACTGATT CCGCAGCT CAGACCCT TT CT TCA 

y[ GC T C TGAAAGAGAAACAC GTATCCCACCT GACATGT CCT T CT GAGCCCGGTAAGAGCAAAAGAAT 

!|J GGCAGAAAAGTT TAGCCCCT GAAAGCCAT GGAGAT T CT CATAAC TT GAGACCTAATC TCT GT AAA 

W GCTAAAAT AAAGAAAT AGAACAAGGC TGAGGATACGACAGTACACT G T CAGCAGGGAC TGTAAAC 

111 ACAGACAGGGTCAAAGTGT T T T CTCTGAACACAT TGAGT T GGAATCAC T GT T TAGAACACACACA 

C TTACT TT TT C TGGT CT CTACCAC TGCT GATAT TT TCT CTAGGAAATATAC TT T TACAAGTAACA 
AAAATAAAAAC TCTTATAAATT TCTATT TT TATCTGAGT TACAGAAATGAT TACTAAGGAAGATT 
ACTCAGTAAT T TGTT TAAAAAGTAATAAAATTCAACAAACATT TGCTGAATAGCTACTATATGTC 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T TGC TGAGACT AATC T TAT TCAT T T TC TC TAAT AT GGCAACCATTATAACCT T AAT T 
TAT TATT AACATACCTAAGAAGTACAT TG T T ACCTC TATAT ACCAAAGCACAT T T TAAAAGT GCC 
AT TAACAAAT GTAT CACT AGCCC TCC T T T T T CCAACAAGAAGGGAC T GAGAGATGCAGAAAT AT T 
T G T G AC AAAAAAT T AAAGC AT T T AGAAAAC T T 
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FIGURE 6 



MARCFSLVLLLTS IWTTRLLVQGSLRAEELS IQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKAS FE TCS YGWVGDGFWI SRI S PNPKCGKNGVGVL I WKVPVSRQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVI^LFFGAAAGLGFCYW 
RYVKAFP FTNKNQQKEM I E TKWKEEKANDSNPNEE SKKT DKNPEE SKS PSKT TVRCLEAEV 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 

CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GC T CT CAGCT ACCCGCAGGAGGAGGCCACCCT CAATGAGAT GT TCCGCGAGGT TGAGGAACT GAT 
GGAGGACACGCAGCACAAAT T GCGCAGCGCGGTGGAAGAGAT GGAGGCAGAAGAAGC TGCTGCTA 
AAGGATCATCAG7\AGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
TGGACAAATGGTCT TT TCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGT GCAT CATCGACGAGGACTG TGGGCCCAGCATGTACT GCCAGT T T GCCAGC T TCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
fj| GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
m ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
: f| CCAGGAGC TGGAGGACC TGGAGAGGAGCCTGAC TGAAGAGAT GGCGC TGGGGGAGCC TGCGGCT G 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
Q TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
j|l CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
%{ CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
J|f T T TG TT CTACAT GGC TT T GATAATTGT T T GAGGGGAGGAGAT GGAAACAAT GT GGAGT CT CCCTC 

CI TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
III CAACAAAT GAAT T T T CCACGCAGT T C T T T CCAT GGGCATAGGTAAGCT GT GCCT T CAGCT GT TGC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACT ACCCCACAC CAGCC T T GG TGCCACCAAAAGT GCTCCCCAAAAGGAAGGAGAAT GGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
AT T GACAC TGT CCC T C T T T GGCAGT TGCAT T AGTAAC TT TGAAAGGTATAT GAC TGAGCGTAGCA 
TACAGGT TAACC TGCAGAAACAGTAC T T AGGTAAT TGTAGGGCGAGGATTAT AAATGAAAT T TGC 
AAAATCACT TAGC AGCAAC T GAAGACAAT TAT CAACCACGTGGAGAAAAT CAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
T TT T CAGGT GTCAT GGACT GT T GCCACCAT GTAT TCATCCAGAGT TC T TAAAGT T TAAAGT T GCA 
CATGATTGTATAAGCATGCTTTC TT TGAGTTT TAAATTATGTATAAACATAAGTTGCAT TTAGAA 
ATCAAGCATAAATCACT TCAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 



tr. 

Ill 

m 
■ 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPWPGPALSYPQEEATLNSMFREVEELMEDTQHKL 

RSAVEEMEAEEAAAKASSEWIANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQM^ 

TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 

TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 

ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 

RSLTEEMALGEPAAAAAALLGGEEI 



Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amine acids 96-100, 106-110, 121-125, 204-208 



$ Casein kinase II phosphorylation site. 

Q amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

J"; 327-331 

fl N-myristoylation site. 

Iff amino acids 202-208, 217-223 



Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGT AT AT T GACCACAAGCT CAGAGGAGT GCAGAACCCT GT AGCCCGCT GCATCATG TGC TGTT T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACC TGGAGCGGAACAACGGC TCCC TGGACCGGCCC TACTACATGT CCAAGAGCCT T CTAA 
AGAT TC T GGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGC T CC GG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT T TAT TAAAGATAT T T T GTTAAC T C 



FIGURE 10 

RTRGRTRGGCEKVP INTSCNPTAHLVNSSCPGLMCVFQGYS SKGL I QRS VFNLQI YGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I S AFIRTLRYHTGS LAFGAL ILTLVQI ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTL FLC FLEDLERNNGS LDRPYYMSKS LLKI LGKKNEAPPDNKKRKK 

Important features: 
In Transmembrane domains: 

fjl amino acids 57-80 {type II), 110-126, 215-231, 254-274 

W N-glycosylation sites. 

m 

amino acids 16-20, 27-31, 289-293 

s Hypothetical YBR002c family proteins. 

CI amino acids 27 6-288 

; 5 T Ammonium transporters proteins. 

Ill 

amino acids 204-231 



N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 

TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTG'TGCAGCTGCTGCCCCGC 

CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 

TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 

GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 

CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 

GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 

ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 

CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 

AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 

ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 

CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 

CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 

ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 

AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 

TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 

GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 

ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 

QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 

VGL 1 1 FLLCT LF I S LRS S DHRQVNS LMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVT YS Y 

SFFHFCLVIASLHVMMTLTNWYKPGETRKMISTWTAVWVKICASWAGLLLYLWTLV 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGAT TGAG TCCTAT GAAGGAAGGGAAAAGAAAGGCATATCTGAT GT CAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGAT GCAGTATGACTACTAT TCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCAT TGGTGGGCAATAGC GT T GACAACGGCAGT GACCAGTGCCT TT T TAC TAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTT GCC T GGAT TGAGACGT GGT TCCTGGAT TTCAAAGT GT TACCT CAAGAAGCAGAAGAAGAAAA 
CAGACT CCTGATAGT T CAGGATGCTTCAGAGAGGGCAGCACTTATACCT GGTGGTCTT TCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT T T TAGAACT ATGAG TACT ACT T TTGT TAAAT GT GAAAAACCCT CACAGAAAGT C 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CAC TGCTGGCTT TAT TGAACAGC TAATAAAGAT T TAT TTAT T GTAATACC T CACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACTT AATATATT GAAGTAACACT T T T T TAGTAAGCAAGATACC TT T T TAT T TCAAT TCAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
T TAT CAT GAAATCAT GTT T T TC TCT GAT T GT TC TGAAAT G T T CTAAATAC TC TTAT T T T GAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
C TAAAT T AAAAA 



FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFRFKVL I IAYAVCRLRHWWAI ALT T 
AVTSAFLIAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 



Important features of the protein: 
Signal peptide: 

f»l amino acids 1-20 

m 

|P Transmembrane domains: 

Wf amino acids 54-72, 100-118, 130-144, 146-166 

% s l N-myristoylation sites. 

$ amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

rn 
*** 
I if 

ft! 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTAGGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
| ; ,- CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
}n GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
III TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
\f CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
81 TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
I|| TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
fte TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
SI TCTAGCTTGGCTCCTACCTGGCTTAGCCGCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
3 CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
f*\ ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
J GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
5:§5 GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
C-l AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
||| GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
iK GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
l*t GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
r W CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCT GTG T GAC TCTAGTCTT GGCCCCAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 
T T TAGGAACATGT T T T GC T T T T T T AAAAT ATATATAT TT AT AAGAGAT CCT T TCCCAT T TAT TCT 
GGGAAGAT GTT T T TCAAACT CAGAGACAAGGAC T T TGGT TT TT GTAAGACAAACGATGAT AT GAA 
GGCC T T T T GTAAGAAAAAATAAAAGAT GAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGS FAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHE I TNETFRGLRRLERLY 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTIASPEE 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAALAAVGAAYCWRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPS 

CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites . 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



m 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGATGAATCCT TAGAT T CCAAGAC TAC T T TGACAT CAGATGAGT CAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATAT TTCT TGAT TCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCT TTCTAGAGTCTCCAAATCCAGAAAACAAGGAC TATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGC T TT GACCGCCAT TGAAGGCACAGCACAT GGGGAGCC CTGCCACT TCCCT T T T C T 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
C TACAACCTATGAC TACAAAGCAGAT GAAAAGT GGGGCT T T TGTGAAACT GAAGAAGAGGCT GCT 
AAGAGACGGCAGAT GCAGGAAGCAGAAATGAT GTAT CAAACTGGAATGAAAATCCTTAAT GGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
f s CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
I 5 GCAGCGAGAGAGAT GT T T GAGAAGC TGACT GAGGAAGGC T CTCCCAAGGGACAGACTGC TC T TGG 

CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
|J| T T GGAGC T C TTGGGGGCAATC TAATAGCCCACATGGT TT T GGTAAGTAGACT T TAGT GGAAGGCT 

111 AATAATATTAACATCAGAAGAATT TGT GGT TTAT AGCGGC CACAAC T T TT TCAGCT T T CAT GATC 

^ CAGAT TT GCT T GTAT T AAGACCAAATAT T CAGT TGAACT T CCT T CAAAT T CT TGT TAAT GGAT AT 

j*l AACACAT GGAATC TACAT GT AAATGAAAGT T GG TGGAGT CCACAATT T T T CTT TAAAATGAT TAG 

HI TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

GAAT TGTCAGAATCATTTT TTACAT TAGAT TAT CATAAT TT TAAAAATT T TTCTT TAGT TT TTCA 
AAAT TT TGTAAATGGTGGC TATAGAAAAACAACATGAAATATTATACAAT AT TT T GCAACAAT GC 
CC TAAGAAT TGT TAAAAT T CAT GGAGT TAT TTGTGCAGAAT GAC T CCAGAGAGC TC TACTT TC TG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACT CAT T T T TAAT AAAAT TAT GT CT AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEI^QTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTAT TAT ACAAGAAAGGAGT GTACC TAT CACACACAGGGGGAAAAATGC T CT T T TGGGT GCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T TAT CACT GGATGTGAC TCGGGC T T TGGAAAC T TGGCAGCCAGAAC T T T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCT GTTT GGAC TCAT CAGT GT GACAC TAAAT ATGCT T CCT T T GGTCAAGAAAGC TCAAGGGAGAG 
m TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
III TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCAT T GAACCAGGAT T GT T CAAAACAAACT T GGCAGAT CCAGTAAAGGTAAT T GAAAAAAAAC 
T CGCCAT TT GGGAGCAGC TGTC TCCAGACATCAAACAACAATATGGAGAAGGT TACATTGAAAAA 
AGT C TAGACAAA.C TGAAAGGCAATAAATCC T ATGTGAACAT GGACC TC T CT CCGGTGGTAGAGTG 
Q CATGGACCACGC TC TAACAAGTC TC T T CCC TAAGAC T CAT TAT GCCGCTGGAAAAGAT GCCAAAA. 

ill T T TTCTGGATACCT CTGTCTCACAT GCCAGCAGC TT T GCAAGAC T T T T TAT T GT TGAAACAGAAA 

W GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
| : ; AAT T GGCCGAT T TCAAGAACACATC T CCT T T T CAACCCCAT T CCT TAT CTGCTCCAACCTGGACT 

m CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CATCTCT T AT C TAAAT AT TAAAAGATAAGTCAACC CAAAAA?VAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



I? 5 

u 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
S TALKAE T SERLRT VXiLDVT DPENVKRTAQWVKNQVGEKGLWGL INNAGVP GVLAPT DWLT LEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLft^ 

LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 



U4 



Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites . 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGT TGAAGT TGTT TATACAATTGA 
CATTCAGAAATATAT TCCATGC TAT CAGC TT T T TAGCT T T T ATAAT T CT T CAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAAT GTCAAAAAGAATGTGGTAGGTT GGTACAAAT TCCGT 
CGTCAT T CAGATCAGAT CATGACG TT TAGAGAGAGGC TGCT T CACAAAAACT TGCAGGAGCAT TT 
T TCAAACCAAGACC TTGTTTTTCTGC TAT TAACACCAAGTATAATAACAGAAAGCT GC TCT ACTC 
ATCGACT GGAACAT TCC T T ATATAAACCT CAAAAAGGAC T TT T TCACAGGGT ACC T T TAGT GGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TTTAGCCGAGCAGTACAAACACACAGCT CTAAAT TTTT TGAAGAAGATGGATCCTTAAAGGAGG 
TACATAAGATAAATGAAATGTATGCT TCAT TACAAGAGGAATTAAAGAGTATATGCAAAAAAGTG 
GAAGACAGT GAACAAGCAGTAGATAAACTAGTAAAGGAT GT AAACAGAT TAAAACGAGAAAT T GA 
GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC TCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGT C T T TAAAAAATAGACATGT T T C TAAAAGTAGC T GT AACTACAACCACCATC TCGAT GT AGT 
AGACAAT CT GACCT TAAT GGTAGAACACACT GACAT T CCTGAAGCTAGT CCAGC TAGTACACCAC 
AAATCAT TAAGCATAAAGCC TTAGAC TTAGAT GACAGAT GGCAATT CAAGAGAT CTCGGTT GT TA 
GATACACAAGACAAACGAT CTAAAGCAAAT AC TGGTAGTAG TAACCAAGATAAAGCAT CCAAAAT 
GAGCAGCCCAGAAACAGAT GAAGAAAT TGAAAAGATGAAGGGTT T TGGTGAATATTCACGGTCTC 
C TACATTT TGA T CC T T T TAACC T TACAAGGAGAT T T T T T TATT T GGCT GAT GGGTAAAGCCAAAC 
AT T T CT AT T GT TT T T ACTAT GT TGAGC TAC TTGCAGT AAGT TCAT TT GT T T T TAC TAT GT TCACC 
TGTT T GCAGTAATACACAGATAAC TCT TAG T GCAT T TACT T CACAAAGT ACT TT T TCAAACAT CA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCAT GT CTATTAAAAAATAAAAT GGAAAAGCAAGAATAGCC TTAT T T T CAAAATAT GGAAA 
GAAATT TATATGAAAAT T TATC T GAGT CAT T AAAAT T CTCC TTAAGTGATAC T T T T T TAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAAT TT GCAAAACATCATCT 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 

MEGES T SAVLSGFVLGALAFQHLNTDS DTEGFLLGEVKGEAKNS I TDS QMDDVEWYT I D I QKY I 
PCYQLFSFYNSSGEWEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

w 

fj! Important features: 

Signal peptide: 
amino acids 1-19 

V* N-glycosylation sites, 

a amino acids 75-79, 322-326 

3 * 5 

jv" N-myristoylation site. 

5*1 amino acids 18 4-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



m 



GGCAC AGC CGCGC GGCGGAGGGCAGAGT CAGCCGAGCC GAGT CCAGCCGGACGAGCGGAC CAGCGGAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACC ATG GCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGTAAAGATGATTTCAAAAGT GT GGT CAGC GAACAGT GCAAT CAT T T G 
CAAGCT GT CTT T GCTT CACGT TACAAGAAGTTT GAT GAAT T CT T CAAAGAACTACT TGAAAAT GCAGAGAAAT CC CTG 
AAT GATAT GTTT GTGAAGACAT ATGGC CAT TTAT ACAT GCAAAATT CT GAGCTAT TTAAAGAT CT CTTC GTAGAGTT G 
AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
T T C CGCCT GGT GAACT CCCAGTACCACT TTACAGAT GAGTATCT G GAAT GTGT GAGCAAGTAT ACGGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
T TAGCG GT T G CGGGAGAT GT CGT GAGCAAGGTCTCC GT GGTAAAC C CCACAGC CCAGT GTACCCATGCC CT GTT GAAG 
AT GAT CTACT GCTC CCACT GCC GGGGT CTC GT GACTGT GAAGCCAT GTTACAACTACT GCT CAAACAT CAT GAGAGGC 
TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
GAGGGT CCTT T CAACATT GAATCGGT CAT GGAT CCCAT C GATGT GAAGATTTCT GATGCTATT AT GAACAT GCAGGAT 
AATAGT GTT CAAGT GT CT CAGAAGGT TTT CCAGGGATGT G GAC CCC CCAAGCCCCT CC CAGCT GGACGAATTT CT CGT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATC^CCCCGAGGAACGCCC^CCACAGCAGCTGGCACT 
AGTTT GGACC GACTGGTT ACT GAT GT CAAGGAGAAACT GAAACAGGCCAAGAAATT CT GGTCCTCCCTT C CGAGCAAC 
GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTT T GCAGT GACAG GAAAT GGATTAGCCAACCAGGGCAACAACC CAGAGGT CCAGGT T GACACCAGCAAACCAGAC 
AT ACT GAT CCTTCGT CAAAT CAT GGCTCT T C GAGT GAT GAC CAGCAAGAT GAAGAATG CATACAAT GGGAACGAC GTG 
GACTT CTTT GATAT CAGT GAT GAAAGTAGT GGAGAAGGAAGTGGAAGT GGCTGTGAGTAT CAGCAGT GCC CTT CAGAG 
T TT GACTACAATGCCACT GACCAT GCTGGGAAGAGT GCGAATGAGAAAGC C GACAGTGCTGGTGT C CGTC C TGGGGCA 
CAG GC CTACCT CCT CACT GT CTT CT GCAT CTTGTT C CT GGTTATGCAGAGAGAGT GGAGATAATT CTCAAACT CT GAG 
AAAAAGTGT T CAT CAAAAAGTTAAAAGGCAC CAGTTAT CACTTT T CTACCATCCTAGT GACTTT GCTT TTTAAAT GAA 
TGGACAACAATGTAC^GTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGT GCATT GAGT TGGTTCCTGCTCCCC CAAACCATGTTAAAC GT GGCTAACAGT GTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTT TTTCCAACT GT GAT CT CGC CTT GT TT CTT ACAAGCAAACCAGGGT CC CTT CTTGGCAC GT AACAT GTACGTATT 
T CT GAAATATTAAATAGCT GTACAGAAGCAGGT TT TAT TTAT CAT GTT AT CTTATTAAAAGAAAAAGCC CAAAAAGC 



App_ID=l 00636 17 



Page 175 of 320 



m 



FIGURE 24 



MARFGLPALLCTLAVLSAAiLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
L YMQN S E L FKD L FVE LKRY YWGNVNLE EMLND FWARLLE RM FRLVNS Q YH FT DE YLE CVS KYT E 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTWPCYNYCSNIMRGCI^QGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 



|J1 KS ANEKADSAGVRPGAQAYLLTVFC I LFLVMQREWR 



Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) 



^* amino acids 515-524 



m N-glycosylation site. 



III 



amino acids 514-518 



Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C TGAGTCATCCCCAGGGAT CAGGAGCC T CCAGCAGGGAACC TT CCAT TATATTCT TCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC TT C TAGGAGAT GGCT CCAGGAAGGCGGC CAAGAAT GTGAGTGCAAAGAT TGGT T CC TGAG 
AGCCCCGAGAAGAAAATT CATGACAGT GT C TGGGCTGCCAAAGAAGCAGT GC CC CTGT GAT CAT T 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAG GAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGATUVCTGTAGCTTCCT 
AGC TAGT GT CAT T TAACC T TAAATGCAAT CAGGAAAGTAGCAAACAGAAGT CAATAAATAT TT TT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features : 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GC AGAAT GAGAGAAGAC AT TC ATG T ACCACCTAC T AGT C CC TC TC T CCCCAACC T C T GCC AGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CC TC T CT C C CCACTAC CACC T T CT TCCT GAGC T GGGGGCACCAGGGAGAATC AGAGATGCT GGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGTTT TGTTCTCAAATATTT TT TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEW 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-rnyristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGT TTATGATTCT TATACTAATT TATACAAAGATATTAAGGCCCTGTTCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



fit 
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FIGURE so 



a 
ci 
m 

iii 
p 

s 

0 
IP! 
0 
W 

a 
rn 



MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTT TAT GG T GGC TG T AT TAT GAC T AT ACC AAC GAC C T CAG CATAGAAT T G GAC A 
CAGAAAGG GAAAAT AT GAAG TGCGTGCTGGGGTT T GCT AT C G TAT C C AC AGG CATCACGG CAG T G 
C TGC TCGT CT T GAT TT TTG T TC T CAGAAAGAGAATAAAAT TGACAGT T GAGCTT T TCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATT GGCCT CAT C TGGACT AGT GAAT T CAT CCT TGCGTGCCAGCAAATGAC TATAGCT GGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT T CCGAGAATCAT TGTCATGTACAT GCAAAACGCACTGAAAGAACAGCAGCAT GGT GCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC CAGAAT GCAT ATACTACAACTGC TAT TAATGGGACAGAT TTCTGTACAT CAGCAAAAGATGC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCC CTAC TT TAT GGATCAAGAAT T TCT GAGT T TCGTA 
AAAAGGAGCAACAAAT T AAACAAT GCAAGGGCACAGCAGGACAAGCAC T CAT TAAGGAAT GAGGA 
GGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCTGGAAAACATT 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTTTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDT I LGLC I LALALS LAMMFT FRFI TT LLVHI FI S LVI LGLL FVCGVLWWLYYDYTNDLS IE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCY FNRSKNDPPDHP ILSSLSIL FFYHQGTWKGS FL I S WRI PRI I VMYMQNALKEQQHG 
ALSRYLFRCCYCC FWCLDKYLLHLNQNAYT TTAINGTDFCT SAKDAFKI LSKNS SHFT S INC FGD 
FIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHSFLSVFETVLDALFLCFA 
VDLETNDGS SEKP YFMDQE FL S FVKRSNKLNNARAQQDKHS LRNEEGTELQAI VR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGT TATT CCAACGGT GT CCAATCGT T AT CCCTACCACGATGGAGAGAAT CCT T 
TATCGTC T T AGAAAGTAAACCCAAAAAGGGTGTAAC CTACCCATCAGCTC T T ACATAC T CAT CAT 
|^ CGAAAAGTCCAGC TGCCCAAGCAGGT GAGACCACAAAAGCC T AT CAGAGGCCACCTAT T CCAGGG 

f*| ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
!;*f GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
It* AGAGC TGATCCAGG TAT CCAAAGGCAAGAT CC T T CAGGAGC TGCCT TCCAGAAAC CTGT T GGAGC 

i|| GGATGTCAGCC TGGGACT T GT T CCAAAAGAAGAAT TGAGCACACAGT C T T T GGAGCCAGTAT CCC 

f|| T GGGAGAT CCAAAC TGCAAAAT TGACTT GT CGT T TT T AATT GATGGGAGCACCAGCAT T GGCAAA 

i£ CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
ft CGGTCCAC TGATGGGT GT TGTC CAG TAT GGAGACAACCC TGCTACTCACT T TAACCTCAAGACAC 

*** ACACGAAT TC T CGAGATC TGAAGACAGCCATAGAGAAAAT T AC TCAGAGAGGAGGAC TT TCTAAT 

H GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
fjl GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
m GAC T TGCGAGAGAGTCAGGAAT CAACATT T T C T TCAT CACCAT T GAAGGT GC T GC TGAAAATGAG 

'J; AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
M GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
III ACAC TGACCGCC TGGCCT GCAGCAAGAC CT GCT T GAACT CGGCT GACATT GGCT T CGT CAT CGAC 

|J| GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
m AGAGT T TGAGAT T TCCGACACGGACACGCGCAT CGGGGCCGTGCAGTACACC TACGAACAGCGGC 

lf * TGGAGT T T GGG T TCGACAAGTACAGCAGCAAGCC TGACAT CCTCAACGCCAT CAAGAGGGT GGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGT TAAT GAT CC TCAT CACCGACGGGAGGT CC TACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GTT TGACAACC T CCAT CAG TATGTCCCCAGGATCATCCAGAACATT TGTACAGAG TT CAACT CAC 
AGCCT CGGAAC TGAAT T CAGAGCAGGCAGAGCACCAGCAAGT GC TGC T TTAC TAACTGACGT GTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACAT 
T T T GACAAT T GTT T TCAAAATAAATGTTCGGAATACAGTGCAGC CCT TACGACAGGC T TAC GTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T TT TGTCAT GACAAT GTAGGAATTGC TGAATTAAATGT T T AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTmQLIAVTVAVA 
T PT TLPRPS PSAAS T TS I PRPQS VGHRSQEMDLWS TAT YT S SQNRPRADPG I QRQDPS GAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVWVMVDGWPTDKVEE^SRLARESGINI^^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILI TDGRSYDDVRI PAMAAHLKGVI TYAIGVAWAAQEELEVIATHPARDHSFF 
VDE FDNLHQ YVPR I IQNICTE FNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain : 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGC AC AGGAGATT GC CTGC GT TT AGGAGGTGGC TGCG TT GT GGGAAAAGCT AT CAAGGAAGAAAT TGC 
CAAACC ATGT CT T TTTTT CTGT TT TCAG AGT AGT T C ACAACAGATCT GAGTGTT TT AATT AAGCAT GG AAT 
ACAGAAAACAAC AAAAAACTT AAGCTTT AATT TC AT CT GGAAT T CCAC AGTTTT CT TAGCTC CCT GGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATG GCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGAT GT C ACT GAGAT C CC TC AAAT GG AGCCTCCT GC TGCT GTCACT CC TGAGT T TC TTT GTGAT GT GGT AC 
CTCAGC CTTC CC CACT AC AATGTGAT AG AACGCGT GAACT GGATGT AC TT CT AT GAGT AT GAGCCGAT T T A 

S^i, C AGACAAGAC TT TC AC TT CACACTTCGAGAGC ATT CAAAC TGCT CT CATC AAAATCC AT TTC TGGT CAT T C 

T GGT GACCT C CC ACC C TT CAGATGTGAAAGCC AGGC AGGC CATT AGAGTT AC T T GGGGT GAAAAAAAGT CT 

Cl T GGT GGGGAT AT GAGGTT CTT AC ATT TT T CTT ATT AGGCC AAGAGGCT GAAAAGGAAGAC AAAAT GTT GGC 

m 

^ s ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 

W ACC T GACCT T GAAAAC CATT AT GG CATT CAGGTGGGTAAC TGAGT TT T GC CCCAAT GCCAAG TACGTAATG 

* ?S AAGACAGAC ACT GAT GTT TTCATC AAT ACT GGC AAT TT AGT GAAGT AT CT TT T AAACCT AAACCAC TCAG A 

I = GAAG TT TTT C AC AGGT TATCCT CT AATT GATAAT TAT T CC T AT AGAGG AT TT T ACC AAAAAACCCATAT TT 

'?! 

C TTACC AGGAGT ATCC TT TCAAGGTGTT CCCT CC AT AC TG CAGT GGGT TGGGTT AT AT AATGT CCAGAGAT 

m T TGG TGCC AAGGATCT AT GAAATG ATGGGT CACGTAAAAC CCAT CAAGT T TGAAGAT GT T TAT GT CGGGAT 

111 CTGT TT GAAT TT ATT AAAAGTGAACAT TCATATT CC AG AAGACACAAATC TT TTCT T TCT AT ATAGAATCC 

|3 ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 

III T GGC AGGTCATGCT AAGGAACACC ACAT GC CAT TAT TAAC T T C AC AT T CT ACAAAAAGCCTAGAAGGACAG 

l»l GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 

m 

5 AC AC TGAACT GAAACT CATG AAAAACCC AGACTGGAGACT GGAGGGT T AC ACTT GT GATT TATTAGTC AGG 

CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
ACCAAACAAT TT GGAC AT GTCATT CT GT AGACT AGAAT T T CT T AAAAGGG TGTT ACT G AGTT AT AAGCTC A 
CT AGGCTGT AAAAACAAAACAATGT AGAGT TT T AT T TAT T GAAC AAT GT AGT CACT T GAAGGT T TT GT GT A 
T ATC TT ATGT GGATT ACC AATT TAAAAAT AT ATGT AGT T CTGTGTC AAAAAACT TCT TCACT GAAGTT AT A 
CTGAAC AAAATT T T AC CT GTTT TT GGTC AT TT AT AAAGT ACT TC AAGATGT TGC AGT ATTTC ACAGTT ATT 
ATTATT TAAAAT T ACTTC AACT TT GTGT TT TT AAAT GT TT TGACGAT T TCAAT ACAAGAT AAAAAGGAT AG 
T GAATC ATT C TT TAC ATGC AAACATTT T CCAGT T AC TT AACT GATC AGTT T ATT AT TGAT AC ATCACT CCA 
TTAATGT AAAGT C ATAGGTCAT TATTGC AT ATC AGT AATCT CTT GGAC TT TGT T AAATAT TT T ACT GT GGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYWIERVNWMYFY 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 

LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 

NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 

KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 

NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites . 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT TGCCAAGAGAGTAC AC AGTC AT TAAT GAAGCCT GCCC TGGAGCAGAGTGGAATATCAT GTGTCGGGAGTGCTGTG 
AATATGATCAGATTGAGTGCGTCTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGT TC TGCGAGCCCC AAAGGGTCAGAT TTTGT T GGAAAGC TATCCCC T AAAT GC TCACTGTGAATGGACCATTCATGC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 
GTTCGTGAT GGAGACAACCGC GATGGCCAGATCAT C AAGCGTGTC TGTGGCAACGAGCGGCCAGC TCC TAT C C AGAGCATAG 
GATCC TCAC TCCACGTC CTC T TCC ACTCCGATGGC TCCAAGAATT TT GACGGTTTC C ATGC CATT TATGAGGAGATCACAGC 
ATGC TCC TCATC CCC T TGTTT CCATGACGGC ACGT GC GTCC TTGACAAGGC T GGATC TTACAAGTGTGCCTGC TT GGCAGGC 
TAT AC TGGGCAGCGC T GTGAAAATCTCC T TGAAGAAAGAAAC TGC TC AGAC CC TGGGGGCCCAGTCAATGGGTACCAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
T C T TAGTGGCAATGAGAAAAGAAC TTGCCAGC AGAATGGAGAGT GGTCAGGGAAAC AGC CCATC TGCATAAAAGCCTGCC GA 
GAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGTTCAGTCAAGGGAGACACCATTACACCAGCTAT 
AC T CAGC GGCCT TCAGCAAGCAGAAACTGCAGAGT GCCCCTACCAAGAAGC CAGC C C T TCCCT T TGGAGATC TGCCCATGGG 
ATACC AAC ATCT GCATACCC AGC TCCAGTAT GAGTGCATC T CACCCT TC TAC CGC CGCC TGGGCAGCAGCAGGAGGACATGT 
\| C TGAGGAC TGGGAAGTGGAGTGGGCGGGC AC CATC C TGCATCCCTATC TGCGGGAAAAT TGAGAACATCAC T GCT CC AAAGA 

5 CCC AAGGGTTGCGC T GGCCGT GGCAGGC AGCCATC TACAGGAGGACCAGCGGGGTGCATGACGGC AGC C TACACAAGGGAGC 

1^1 GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
HI GTCACCATGATC AAGAC AGCAGACC TGAAAGTTGT T TTGGGGAAAT TC T ACC GGGAT GATGACCGGGATGAGAAGAC CATCC 

ff| AGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCT 
|jj CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCC CATC TGCC T CGC TGCCAGT C GGGAT CTCAGCAC TT CCTT C CAGGAG 

f n TCCCACAT C AC TGTGGC TGGC TGGAATGT CC T GGC AGACGTGAGGAGC C C TGGCT TCAAGAACGACACACTGCGC T C TGGGG 

|-p TGGTCAGTGTGGTGGAC TCGCTGC TGT GTGAGGAGCAGC ATGAGGAC CAT GGCATC C CAGT GAGTGTCACTGATAACAT GTT 

C TGTGCCAGCT GGGAACCCAC TGCCCC T TCT GATAT CTGC ACTGCAGAGACAGGAGGCATCGCGGCTGT GTCC TT CC CGGGA 
CGAGCATC TCC T GAGCCACGCTGGC ATC TGATGGGACTGG TCAGC TGGAGCTATGATAAAACATGCAGCCACAGGC T CTCCA 
C TGCC TTC ACCAAGGTGC TGCCTT T TAAAGAC TGGATTGAAAGAAAT ATGAAATGAACCATGC TCATGCAC TC CT T GAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTT C TGACTTCAGGGACAAAAC TC AGT GAAGGGTGAGT AGACC T CCAT TGC T GGTAGGC TGATGCC GC GTCCA 
C TAC T AGGACAGCCAAT TGGAAGAT GCCAGGGCTT GCAAGAAGTAAGT T TCT TCAAAGAAGACCATAT ACAAAAC CTC TCC A 
CTCCAC TGACC TGGTGGTC T TCCCC AACT TTC AGT TATACGAATGCCATCAGC TTGACCAGGGAAGATC TGGGCT TC ATGAG 
GCC CC TTT TGAGGCT C TCAAGTTC TAGAGAGC TGCC TGTGGGACAGCCCAGGGCAGC AGAGCT GGGATGTGGT GCAT GCCT T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 



P 



5 ' ~: 



MELGCWTQLGLTFLQLLLISSLPREYTVINSACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLES YPLNAHCEWT I HAKPGFVI QLRFVMLS LEFDYMCQYDYVE VRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWS GKQP I CI KACRE PKI S DLVRRRVLPMQVQSRET PLHQLYSAAFS KQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QSLQ I SAI I LHPNYDP I LLDAD I AI LKLLDKARI S TRVQP I CLAAS R 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
S WEP TAPS D I CTAETGG I AAVS FPGRAS PEPRWHLMGLVS WS YDKTC SHRLS TAFTKVLP FKDWI 



ERNMK 



Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 

amino acids 260-272 



v%* N-glycosylation sites. 



amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 
474-480, 491-497, 638-644, 666-672 
Amidation site, 
amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGT TC CTACAT C CT CT CATCT GAGAAT CAGAGAGCATAAT CTTCTTACGGGCCC GTGATT T ATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCC CTT GT AGCT GACAGAAGGT GGCCAGGGAGAAT GCAGCACACT GCT CGGAGA&TGAAGGCGCTT CT GT TGC 
T GGT CT T GCCT TGGCT CAGT CCT GCTAACT ACATT GACAAT GT GGGCAAC CTG CACTT CCT GTATT CAGAACT CT GTA 
AAGGTGCCTCCC^CTACGGCCTGACOW^GATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T CACAGCCACG GCTCCCT CCCCAGAGGT TT CT GCAGCT GCCACCAT CT C CTTAAT GACAGACGAGC CT GG C CTAGACA 
ACC CT GCCTAC GT GT CCT CGGCAGAG GACG GGCAGC CAGCAAT CAGCC CAGT GGACTCT GGCCGGAGCAAC C GAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGG GAGTGCAGT T GC CAAC CAT GC CGACCAGGGCAGGGAAAATT CT GAAAACACCACT G C CCCT GAAG 
T CTTT C CAAGGTT GT ACCAC CTGATT CCAGAT GGT GAAATTACCAGCAT C AAGAT CAAT C GAGTAGAT C C CAGT GAAA 
GCCTCT CTAT TAGGCTGGT G GGAGGTAGC GAAACCC CACT GGT CCATATCATTAT CCAACACATTTAT C GT GAT GGGG 
III TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
|.|| ACAACTACG CT GT GCGTCT C CTGC GGCAGCCCT GCCAGGT GCT GT G GCT GACT GT GAT GCGTGAACAGAAGTT CCGCA 

fpi GCAGGAACAAT GGACAGGCC CCG GAT GC CTACAGAC CCC GAGATGACAGCTTT CAT GT GAT T CT CAACAAAAGTAGCC 

|#is CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 

GTGT GGCATAT CGACATGGT CAG CTT GAGGAGAAT GACC GT GT GTTAGCCATCAAT GGACATGAT CTT CGATAT GGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAG CCCT GACAT CTTTCAGGAAGCCGGCT GGAACAGCAAT GGCAGCT GGT CC CCAGGGCCAGGGGAGAGGAGCAACA 



til CTCC CAAGCCC CT CCATC CT ACAATTACTTGTCAT GAGAAG GT GGTAAATATC CAAAAAGACCCC GGT GAATCT C TCG 



GCAT GACCGT CGCAGGGGGAG CATCACATAGAGAAT GGGATTT GC CTAT CTAT GT CAT CAGT GTTGAGC C CGGAGGAG 
TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 
GGAGT GAGGCAGT GGCAT TATTGAAAAGAACATCAT CCT CGATAGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 
C CCAGGAAGACT G CAGCAGC C CAGCAGCC CTGGACT CCAAC CACAACAT GGCCCCACCCAGTGACT GGT CCC CAT CCT 
GGGT CAT GT GGCT GGAAT TACCAC GGT GCT TGTATAACTGTAAAGATAT T GTAT T AC GAAGAAAC AC AG CTG GAAGT C 
TGGGCTTCT GCATT GTAGGAGGTTAT GAAGAATACAAT GGAAACAAAC CTTTT TTCAT CAAAT CCATTGT TGAAGGAA 
CAC CAGCATACAAT GAT GGAAGAATTAGAT GTGGT GATATT CTTCTTGCTGT CAAT GGTAGAAGTACAT CAGGAATGA 
T ACAT G CTT GCTT GGCAAGACTGCT GAAAGAACT TAAAGGAAGAATTACTCTAACTATT GTTTCT T G GCCT GGCACTT 
TTTTATAGAATCAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATAT TTATC 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACACCT CAGAAAATAT GAT T CCAAAAAAATTAAAACTACTAGTT TTTT TT CAGT GT GGAGGATT T CT CATTACTCTAC 
AACATT GTTTATATTT TT TCTAT T CAATAAAAAGCC CTAAAACAACTAAAATGATT GATTT GTATAC CCCACT GAATT 
CAAGCT GATTTAAATT TAAAATTT GGTATAT GCT GAAGTCT GC CAAGGGTACATTATG GCCAT T TTTAATTTACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAAACAAG 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVS SAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAV7USIHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI I IQHI YRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRV1AINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPT I TCHEKVVNIQKDPGESLGMTVAGGASHREWDLPI YVI SVEPGGVI SRDGR 
IKTGDILLNVDGWLTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGT GAAGCAGAGAGATAGAT ATT ATTCACGTAAT AAAAAAC ATG GGCT TCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCGATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGAC C CT CAT TTT GGG GAAG GGAAAAACT C T GACT AAT GAAG CAT C CACGAAGAAGGT AGAACT T 
GACAACT GTCCTT CTGTGT CTCCTTAC CT CAGAGGCCAGAGCAAGCTCATT T TCAAA.CCAGATCT CAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
t* k CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
*** CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 

Q 

T AAAAAGT T TAAT CGAG CC AAAC T CT T GAAT G T GGG CTAT CTAGAAG C C CT C AAGGAAGAAAAT T GGG 
t'll ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
ii% CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
Uk GGGT GTTACTGCCCTAAGCAGAGAGCAGT TTT T CAAGGTGAAT GGATTCTCTAACAACTAC TGGGGAT 

Si G GGGAGGC GAAGACGAT GAC C T CAGAC T C AGGGT T GAGC T CCAAAGAAT GAAAAT T T C C C GGC CC CTG 

- CCTGAAGT GGGT AAATATACAAT GGTCTT CCACACTAGAGACAAAGGCAATGAGGT GAACGCAGAACG 

ft* 

M GATGAAGCTCTTACACCAAGT GT CACGAGTCT GGAGAACAGAT GGGT TGAGTAGT TGTTCT TAT AAAT 

II! 

JJJ TAGT ATC T GTGGAACACAATCCT T TATATATCAACAT CACAGT GGATT T CTGGTTTGGTGCATGACCC 

j!| TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
i*l ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
111 T AGCAGAGCTCCTGGTGAT GTAGAGTATAAAACAGT TGTAACAAGACAGCTT T CTTAGTCATT TTGAT 

CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAAC G CTAT T T GAG GAC T CT GGT T GAAGGAGAT T TAT T TAAATT T GAAGTAATATAT T AT GGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGG TACGAAGAT ACAAT ACT G T TAT T CAT TT AT C C T GTAC AAT CAT C TGT GAAGT GG TG G T GT C AGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACAC^ 

GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGTTT TTGTAAAATGATTTTGTACAAGTAGGATAT GAAT TAGCAGTT TACAAGTTTACAT 
ATTAACTAATAATAAATAT GT CTAT CAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLGLT WGWAT SNYFVGAI QE I PKAKE FMANFHKTL I LGKGKTLTN 
EAS TKKVELDNC PS VS PYLRGQS KL I FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAI LVPH 
RNREKHI^LLEHLHPFLQRQQLDYGIWIHQAEGKKFNRAKLLOTGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLY INI TVDFW FGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 4 ft 



ill 
si 

111 



GCT CAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCAC T GAGC TCCCAGAT CT GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQIAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 



f*\ amino acids 58-59 



N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATG GCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TT CAACACAACCC CTCT TGTCACCATACAGCCAGAAGGGGGCACTAT CATAGT GACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACT CAGGGAT CT AC TATGT GGGGATATACAGCT CATCAC TCCAGCAGCCC TCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
AT AAG AAT G G C AC C T G T G T GAC C AAT C T G AC AT G C T G CAT G G AAC AT G G GG AAGAGG AT G T GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGAT G GGGAGAAAG T GAT AT GAC C T T CAT C T GC G T T GCCAGG AAC C C T G T C AGC AG AAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTT TC TGAAGAGAGAGAGACAAGAAGAGTACAT T G AAG AG AAG AAG AG AG T GGACAT T T GT CGGG 
AAAC T CCT AACATAT GCCCCCAT TC TGGAGAGAACACAGAGT ACGACACAAT CCCTCACACTAAT 
AGAACAAT CCTAAAGGAAGATC CAGCAAATACGGT TT AC T CCAC TGTGGAAATACCGAAAAAGAT 
GGAAAAT CCCCACT CACT GC TCACGAT GCCAGACACACCAAGGCTAT T TGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE J.6 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGT 1 1 VT QNRNRERVD FPDGGYS LKLSKLKKNDS G I YYVGI YS S S LQQP ST QE YVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNL T CCMEHGEE DVI YTWKALGQAANE S HNGS I LP I SWRWGE SDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAG T TGAGGAAGACCAAT T T T C TCAAAAC CCCATCT CT T GCT T T GAGTGGT GGT TCCCAGGAAT T 
ATAGGAGCAGGTCTGAT GGCCATTCCAGCAACAACAAT GT CCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAAT GCCAAT T G T GAAT T T T CAT T GAAAAACAT CAGTGACAT TCAT CCAGAATCC T T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGTGGC TGGAGAGCATC TAGT T TC CACT T CGAT TC TGAAGAAAACAAACATAGG 
p CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
]:Jf CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
f " T GTAGT TTAATGGGAATAAAATGTAAGT ATCAGTAGT T T GAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTS CNGFS LLVLLLLGWLNAI PL I VSLVEEDQ FSQNP I S C FE WW FPGI I GAGLMAI PA 
TTMS LTARKRACCNNRT GMFL S S FFS VI T VI GALYCML I S I QALLKGPLMCNS P SNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVG I LE VL FGLS Q I VI G FLGCLCGVS KRRS Q I V 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACCT GCAGAGTCACACAGT TGC TGACCAATTGAGCTGT GAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCT GGCCCT T CT CCTAC TGGCAGGCCT GAC T GCCT TGGAAGCCAAT GACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE SO 

MERVTLALLLLAGLTALEANDPFANKDDP FYYDWKNLQLS GL I CGGLLAI AGIAAVL S GKCKYKS 
S QKQHS PVPEKAI PL I T PGSAT T C 



Important features: 
Signal peptide: 

amino acids 1-16 

l * t Transmembrane domain: 

1£% amino acids 36-59 

fi\ 

ill N-myristoylation sites. 

p 1 amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

IJI amino acids 54-67 

0% 
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FIGURE m 

GTGGAC T CT GAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGCT GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGGCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
M AGCAGATGCT TTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACAC TGGGCACGAGA 

fjf ■ TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

SJ GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

111 

i,i CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
If! GAAACT CAGCAGGCAGCT T TGGAATGAAT CCT CAGGGAGCTCCCTGGGGT CAAGGAGGCAAT GGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
f| AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
IS AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
p TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
III CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

?n 

*** CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

AH 

C T TCAGAGGACAGGGAGT T TCCAGCAACAT GAGGGAAAT AAGCAAAGAGGGCAATCGCCT CCT T G 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCMTACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAA.TAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 

MKFQGPLACLLIALCLGSGEAGPLQSGEESTGTNIGEMjGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
E AL G QGT RE AVG T GVRQVPG F GAADAL GNRVGE AAHALGNT GHE I GRQAE DVI RHGADAVRGS Vtf QGV P 

GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGS GESG I QGFRGQGVS SNMRE I SKEGNRLLGGS GDNYRGQGS S WGS GGGDAVGGVNTVNSET S PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRS SRI P 

!f Signal peptide: 

amino acids 1-21 

s 

I i; 
III 

||| N-glycosylation site. 

h& amino acids 265-269 

%l 

* ? , Glycosaminoglycan attachment site. 

{J amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



hj Casein kinase II phosphorylation site. 

p amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



HI 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 



AppJD=l 00636 17 



Page 204 of 320 



FIGURE S3 



m 



Q'l 



GGAGAAGAGGT T GTGT GGGACAAGC T GCT C CCGACAGAAGGATGTCGC TGC T GAGCC TGCCCT GG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
T TGCACCCAAGGATAAT C T C T T CATCAGGTTCC T GAAGCCCTGGC TGGGAGAAGGGATACT GC T G 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATAT7VACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CC T CAGAGGGCAGCAGTCGTCTGGACATGT T TGAGCACAT CAGCC TCAT GACCT T GGACAGTCTA 
CAGAAAT GCAT C T T CAGC T T T GACAGC C AT T GT C AGGAGAG GC C C AG T GAAT AT AT T GC CAC CAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
SJ ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
i GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 

AGCCAAG T CCAAGAC T T T GGAT T TCAT T GATGTGCT T CT GCTGAGCAAGGAT GAAGAT GGGAAGG 
■J* CAT T GTCAGAT GAGGAT ATAAGAGCAGAGGC T GACACC TT CAT GT T TGGAGGCCATGACACCACG 

f 5 GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ft ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



5^ = 
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FIGURE 54 

MSLLSLPWLGLRPVAMSPWLLLLLWGSW 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSMTIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

I^FIPFSAGPRNCIGQAFAblAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAEGGLW 

VGLQ 

Important features : 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAA.GCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA T TCAT GGCCAGT T T TATGAAGCT T T GGAAGGCACT AT GGACAGAAGC TGGT GGACAGT TT T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TC TGGATGTT GTCCCACT GAAT TCCCATGAATACAAACCTAT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAA?VAAAAAAAAAAAAAAAAZ\AAAAAAAAAAAAA 
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FIGURE 56 



MGPWQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGIALIFCILQSLALTWYSLSFIPF^ 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

j f!! amino acids 30-72 

§ 
if) 
in 

M 
0 

yi 

0 

III 

0 

rw. 
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FIGURE S7 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGT GCCGGT CAT C GGGGAAGC CCAGACC GAGTT CCAGTACTTT GAGTC GAAGGGGCT C C CT GC CGAGCT GAAGTCC 
AT TTT CAAGCT CAGT GT CTT CAT CCCCTCC CAGGAATT CT C CACC TAC CGC CAGT GGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGAC CTT GAT GGGCAGCT AGAC T TT GAAGAATTT GTC CATTAT CT CCAAGAT CAT GAGAAGAAGC TGAGG 
CT GGT GTTT AAGATTT T G GACAAAAAGAAT GATGGAC GCATT GAC GCGCAGGAGAT CATG CAGTC CCT GC GGGACTT G 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAATT C T CAAGAGCATGGATAAAAACGGCACGAT GACCAT CGACTGG 
AAC GAGT GGAGAGACTAC CACCT CCT C CACC CCGT GGAAAACATC CCCGAGAT CATCCTCTACTGGAAGCATT CCACG 
AT CTTT GATGT GG GT GAGAAT CTAACGGTCC CGGAT GAGTT CACAGTGGAGGAGAG GCAGACGGGGAT GTGGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATC^ 

AT CAAGCG CCTTGTT G GT AGT GACCAGGAGACT CT GAGGATTCACGAGAG GCTT GT GG CAGGGTCCTT GGCAGGG GCC 
AT CGCCCAGAGCAGC ATCTACCCAAT GGAGGTCCT GAAGACCC GGATGGC GCTGC GGAAGACAGGC CAGTACTCAGGA 
ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
GGCAT CAT CCC CT ATGC CGGCAT CGACCTT GCAGT CT AC GAGACGCT CAAGAAT GCCT GGCT GCAGCACTAT GCAGTG 
M AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
j|! CCC CT GGCCCTAGT CAGGACC CGGAT GCAGGCGCAAGC CT CTAT T GAGGGCGCT CCGGAGGTGAC CAT GAGCAGCCTC 

T.= TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
ill GCT GT GAGCAT CAGCT AC GT GGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 

||! CGC CC G GCAGT GGAC T C G CT GAT CCT GGG CC GCAGCCT GGGGT GT GCAG CCAT CT CATT CT GT GAATGT GCCAACACT 

r ; AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
?*l GTCCT GCT GACCC CAGCAGACCCT CCT GTT GGTT CCAGC GAAGAC CACAGGCAT T C CTTAGGGT C CAG GGT CAGCAGG 

%l CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 

TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
? CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
f! ; | ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
;S T GT GAAGGAAGAGGAAAGGAT CT G GCCTT GT GGT CACT G GCAT CT GAGCC CTGCT GAT GGCTGGGG CT CT C GGGCAT G 

CTT GGGAGTGCAGGGGGCTCGGGCTGCCTGGCCT GGCT GCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
Q GGCC T GGACC CTGT CAGGAT GGGC CCCAC CT CAGAACCAAACT CACTGT C C CCACT GT GGCATGAGGGCAGTGGAGCA 

CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GTT GGGAAAAGG GTT TT GTCCAGAAGGACAAGC CGGACAAATGAGCGACTTCT GT GCT TCCAGAGGAAGACGAGG 
13 GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
i|| GGGAC CAGCC C CACAT TCCACTT GT GTCACT GCTT GGAACCTATT TATTTT GTAT TTAT TT GAACAGAGTTAT GT CCT 

|f 5 AACTATTTTTATAGAT TT GT TT AATTAATAGCT T GT CATTTTCAAGTT CAT TTTT TATT CATATTTAT GTT CAT GGTT 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTT CT GCT G CCCTT GCTTAACAAT GC CGG CCAAC T GGCGAC CT CACGGTT GCACTT CCATT CCACCAGAAT GAC CTGA 
T GAGGAAAT CTTCAAT AGGAT GCAAAGAT CAATGCAAAAATTGTTATATATGAAC ATAT AACTGGAGT CGT CAAAAAG 
CAAATTAAGAAAGAAT TGGACGT TAGAAGT T GT CATTTAAAGCAGCCTT CTAATAAAGT T GTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE s8 

MLCLCLYVPVI GEAQTE FQYFE S KGLPAELKS I FKLS VFI PSQE FS T YRQWKQKI VQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKI LDKKNDGRIDAQE IMQS LRDLGVKI SEQQAEKILKSMDKNG 
TMT IDWNEWRDYHLLHPVENI PEI ILYWKHS T I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMTUjRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGIIPYAGIDLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHI LRTEGAFGLYRGLAPNFMKVI PAVS I SYV 
VYENLKI T LGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTT GAACC TGACAT CAAACTT TCT GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
|t*S. GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
Cl CATCACT T CT AAAGGCAAGGGGAATGC TAACCT TGAGTATAAAACT GGAGCC TT CAGCAT GCCGG 

Jjf AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
fli CAGC T TT GAGCT GAAC TC T GAGAAT GT GAC CATGAAGGT T G TGT C TGTGCT CTACAAT GTT AC GA 

T CAACAACACATACTCCTGTAT GAT TGAAAAT GACAT T GCCAAAGCAACAGGGGATATCAAAGT G 
" f 3 ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 

CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
§f| GTGCCTT GGCCACAAAAAAGCATGCAAAGT CATTGT TACAACAGGGATCTACAGAACTATT TCAC 

Q CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 
Ifi AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGC TCCAAT AT GAACAAGATAAAT 

W C TAT CT TCAAAGACATAT TAGAAGT TGGGAAAATAAT T CAT GTGAAC TAGACAAGTGTGT TAAGA 

1 11 GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CACCTGGGGAGTGAGAGGACAGGATAGTGCATGT TCT T TGTCTCTGAAT TT TTAGT TAT AT GT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGT 
CGGGGACACCGAT T T TATAAATAAAC T GAGCACC T TC T T T T T AAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

NASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
D I VI QWLKEGVLGLVHE FKEGKDEL S EQDEMFRGRTAVFADQVI VGNAS LRLKNVQLTDAGT YKC 
YI I TSKGKGNANLEYKTGAFSMPEVNVDYNAS SETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPL S PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



T GACGTCAGAATCACCATGGCCAGC TAT CCTTACCGGCAGGGCTGCCCAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGAC TCAGATCACAGT GGC TATAT CT CCAT GAAGGAGC TAAAGCAGGCC CT GGT CAAC TGCA 
AT TGGTCT T CAT TCAATGATGAGACC T GCC TCAT GAT GATAAACATGT TTGACAAGACCAAGT CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCAT GACAGCT TC TCGGATGC TATGACCCAACCATCT GT GGAGAG TGGAG T GCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AAT AGTGAGGACCGGGGCT GAGGCCACACAGATAGGGGCC T GATGGAGGAGAGGATAGAAGT TGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAA^T 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 



FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMM^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLSFEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
j f AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
f<l TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
Q TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
Cpl CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
: r 7 ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
^| CAGC TGGGGAAATAT TAAT CGGTATCCAGGAGGCAGC T GGGGGAATAT T AATCGGTATCCAGGAG 

3 GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CI CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCT CATC TCCAATAAAATAAAAGCAC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
ffj AAAAAAAAAAAAAAAAAA?VAAAAAAAAA^ 
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FIGURE 6a 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGE^ 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



III 

ii 



m 



Important features of the protein: 



fji Signal peptide: 

M amino acids 1-26 

m 



Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



lil N-glycosylation site. 

If amino acids 37-41 



fk N-myristoylation sites. 

U amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC TCTGCCCCTGAGGTCAAGAGAGGATGGGGCTAT TCACTTTTATATATTTATATAAAAT TAG 
TAG T G AGAT G T AAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLP FLLARWGRAWGQ I QT T S ANENS TVLPS S T S S S S DGNLRPEAI TAI I WFS 
LLAALLIAVGLALLTOKLREKR^ 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



G CCAGGAATAACTAGAGAGGAACA ATGG GGTTATT CAGAGGTT TT GTT TT C CT CTTAGTT CTGT GC CT G CT GCACCAG 
T CAAATACTT CCT TCATTAAGCT GAATAATAAT GGCTTT GAAGAT AT T GT CATT GT TATAGAT CCTAGT GT GCCAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGATAT GGT GAC TACAGCTT CTAC GTACCT GTTT GAAGCCACAGAAAAAAGA 
T TTTTTTT CAAAAAT GT ATCT ATATTAATT C CT GAGAATT GGAAGGAAAAT CCT CAGTACAAAAGGCCAAAACATGAA 
AACCATAAACATGCT GAT GTTATAGT TGCAC C ACCT ACACT CCCAGGTAGAGAT GAAC CATACAC CAAGCAGT T CACA 
GAAT GT GGAGAGAAAGGCGAATACAT TGACTT CACC CCT GACCTT CTACTT GGAAAAAAACAAAAT GAATATGGACCA 
CC^GGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
T ACCGT GCTAAGT CAAAAAAAAT C GAAGCAACAAGGTGTTCCGCAGGT AT CTCT GGTAGAAATAGAGTT TATAAGT GT 
CAAGGAGGCAGCT GT CTTAGTAGAGCAT GCAGAAT T GATT CTACAACAAAACT GTATGGAAAAGAT TGT CAAT T CTTT 
CCT GAT AAAGTACAAACAGAAAAAGCAT C CATAAT GTT TAT GCAAAGTATT GATTCTGTT GTT GAATTTT GTAACGAA 
AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCA 
1 %K T CT GAGGATTT TAAAAACAC CATACCCAT GGT GACACCACCTC CT C CACCT GT CTT CT CATTGCT GAAGATCAGT CAA 

*** AGAATT GT GT G CTTAGTT CT T GATAAGT CT GGAAGCAT GGGGGGTAAGGAC CGCCTAAAT CGAAT GAAT CAAGCAGCA 

| s | AAACATTT CCT GCT GCAGACT GT T GAAAAT GGAT CCTGGGTGGGGATGGTT CACT TTGATAGTACT GCCACTATT GTA 

f*| AATAAG CTAAT CCAAATAAAAAGCAGTGAT GAAAGAAACACACT CATGGCAGGAT TAC CT ACATAT CCT CT GGGAGGA 

JJ* ACTTCCATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 

CTGCT GCT GACTGAT GGGGAGGATAACACT GCAAGTTCTT GTATT GAT GAAGT GAAACAAAGT GGGGCCATTGT TCAT 
TTTATT GCT TT GGGAAGAGCT GCT GATGAAGCAGT AATAGAGAT GAGCAAGATAACAGGAGGAAGT CAT T TTTAT GTT 
TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
|~L GTGGGAAAGGACACGT TCTTT CT CAT CACAT GGAACAGT CT GC CT CCCAGTAT TT CTCT CT G GGAT C CCAGT GGAACA 

? s ATAAT GGAAAATT T CACAGT GGAT GCAACTT CCAAAAT GGCCTATCTCAGT AT T CCAGGAACT GCAAAGGTGGGCACT 

,; H T GGGCATACAATCTT CAAGC CAAAGCGAACC CAGAAACAT TAACTATTACAGTAACTT CTCGAGCAGCAAATTCTTCT 

5 GT GCCT CCAAT CACAGT GAAT GCTAAAAT GAATAAGGACGTAAACAGT TT C CC CAGCC CAAT GATT GTTTACGCAGAA 

1^ ATTCTACAAGGATATGTACCTGTTCTTGGAGCC^TGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
Wl T T GGAACT TTT GGATAAT G GT G CAGG CGCTGATT CTT T CAAGAATGAT GGAGT CTACT CCAGGTAT TTTACAGCATAT 

HI ACAGAAAATGGCAGATATAGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCA 

CT GAATAGAGCCGCGTACATACCAGGCT GGGTAGTGAACGGGGAAATT GAAGCAAACC CGCCAAGAC CT GAAATT GAT 
GAGGATACT CAGACCAC CT T GGAGGATTT CAG CCGAACAGCAT CCGGAGGT GCATT T GT GGTAT CACAAGTC CCAAG C 
CTTCCCTTGCCTGACCAATACCGACGAAGTCAAATCACAGACCTTGA^ 

ACAT GGACAGCAC CAGGAGATAATTT T GATGTT GGAAAAGT TCAACGTTATAT CATAAGAAT AAGT G CAAGTAT TCTT 
GAT CTAAGAGACAGTTTT GAT GAT GCT CT TCAAGTAAATACTACTGAT CT GT CACCAAAGGAG GCCAACT CCAAGGAA 
AGCTTT GCATT TAAACCAGAAAATAT CT CAGAAGAAAAT GCAAC CCACATATTTAT TGCCATTAAAAGTAT AGAT AAA 
AGCAATTTGACATCAAAAGTATCC^CATTGCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGAT 
C CTACACCTACT CCTACT CCTACT CCTACT CCT GATAAAAGT CATAATT CT G GAGTTAATATT T CTACGCT GGTATT G 
T CT GT GATT GGGT CTGT T GTAATT GTTAACTT TATTT TAAGTACCACCATT TGAA CCTTAAC GAAGAAAAAAAT CT T C 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAAT GTAAGTAAAGGAT AT T TCT GAAT CT T AAAATTCAT C CCAT 
GT GT GAT CATAAACTCATAAAAATAATTTTAAGATGT CGGAAAAGGATACTT T GAT TAAATAAAAACACTCAT GGATA 
T GTAAAAACTGT CAAGATTAAAATTTAATAGTTT CATT TATTT GTTAT TTTATTTGTAAGAAATAGT GATGAACAAAG 
AT C CTTTT T CATACTGAT ACCT GGTT GTATATTATTT GATG CAACAGTT TT CT GAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTAT CTGAGT AGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACAT T TGGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDP S GT IMENFTVDAT SKMAYLS I PGT AKVGTWAYNLQAKANPETLT I TVTSR 
AANSSVPPITVNAPCMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGWSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVWGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains : 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 

CT CCTTAGGT GGAAAC CCTGGGAGT AGAGT ACTGACAGCAAAGACC GGGAAAGACCATAC GT CCCCGGGCAGGGGT GA 
CAACAGGT GT CAT CTTTTTGAT CTCGT GTGT GGCT GCCTT CCTATTT CAAGGAAAGAC GC CAAGGTAATT T T GACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACTT GCCTGAGG GTGGAC CAGAAGAAAGGAAAGGT CCCCT CTT G CTGTT GGCT GCACAT CAGGAA 
GGCTGT GAT G GGAATGAAGGT GAAAACTTGGAGATTTCACTTCAGT CATT GCTT CT GCCT GCAAGAT CAT C CTTTAAA 
AGTAGAGAAGCT G CT CT GTGT GGT GGTTAACTCCAAGAGG CAGAACTC GTT CTAGAAGGAAAT GGATGCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
C GCAACTACGT GAGCAGC CT GAAGCGGCAGAT CGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCTCAGG 
AAT GGGCAGT ACCAAGCCAGCGAT G CTGCT GGCCT GGGT CT GGACAGGAGC CCCC CAGAGAAAACC CAGGCCGAC CTC 
CT G GCCTT CCT GC ACTCGCAG GT GGAC AAGGCAGAG GT GAATGCT GGC GT CAAGCT GGCCACAGAGT AT GCAGCAGT G 
CCTTT C GAT AGCTT TACT CTACAGAAGGT GT ACCAGCT GGAGACT GGCCTTAC CC GCCAC CCCGAGGAGAAGCCT GTG 
AGGAAGGACAAGC GGGAT GAGTT GGT GGAAGCCATT GAATCAGCCTTGGAGACCCT GAACAAT CCT GCAGAGAACAGC 
CCCAAT CAC CGTCCTT ACAC GGCCT CTGAT TTCATAGAAGGGAT CTACCGAACAGAAAGGGACAAAGGGACAT T GT AT 
GAGCT CACCT T CAAAGGGGACCACAAACACGAATT CAAACGGCT CATCTTATTT CGAC CATTCAGCCC CAT CAT GAAA 
GTGAAAAATGAAAAGCTCAACATGGCC^CACGCTTATCAATGTTATCGTGCCTCTAGC^WWVGGGTGGACAAGTTC 
C GGCAGTT CATGCAGAAT T T CAGGGAGAT GT GCAT T GAGCAGGATGGGAGAGT CCATCT CACT GTT GT TT ACT TT GGG 
AAAGAAGAAATAAAT GAAGTCAAAGGAATACTT GAAAACACTTCCAAAGCT GC CAACTT CAGGAAC TTTACCTT CATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTT T CTGT GAT GT GGACATCTACTT CACAT CT GAATT C CT CAATACGT GTAGGCT GAATACACAGCCAGGGAAGAAG 
GTATTTTAT C CAGT T CTT T T CAGT CAGTACAAT CCT GGCATAATATAC GGC CACCATGAT GCAGT CCCT C CCTT GGAA 
CAGCAGCT G GT CATAAAGAAGGAAACTGGATTT T GGAGAGACTT T G GAT TT GGGAT GACGT GT CAGTAT CGGT CAGAC 
TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
CACAGCAACCT CATAGTGGTACGGACGCCT GTGC GAGGACT CT T CCAC CTCTG GCATGAGAAGCGCT GCAT GGACGAG 
CTGAC CCC CGAGCAGTACAAGAT GTG CAT GCAGT CCAAGGC CAT GAACGAGGCAT CC CACG GC CAG CT GGG CAT GCT G 
GTGTT CAGGC AC GAGATAGAGGCT CACCTT C GCAAACAGAAACAGAAGACAAGTAG CAAAAAAACATGAACTCCCAGA 
GAAGGATT GT GGGAGACACTTTT T CTTT CCTTT T GCAATTACT GAAAGT GGCT GCAACAGAGAAAAGACTT CCATAAA 
GGACGACAAAAGAAT T GGACT GAT GGGT CAGAGATGAGAAAGCCTCCGATTT CT CT CT GTT GGG CT TTTTACAACAGA 
AAT CAAAAT CT CC GCTTT GCCT GCAAAAGTAACC CAGT T GCACCCT GT GAAGT GTCT GACAAAGGCAGAAT GCTTGT G 
AGATT ATAAGCCTAAT GGT GT GGAGGTTTT GAT GGT GTTTACAATACACTGAGACCTGTTGTTT TGT GT GCT CATT GA 
AATATT CAT GATTTAAGAGCAGTTTT GTAAAAAATT CATTAGCATGAAAGGCAAGCATATTTCT CCT CATAT GAAT GA 
G CCTAT CAGCAG GGCT CTAGTTT CTAGGAAT GCT AAAATAT CAGAAGGCAGGAGAGGAGATAGGCTTATTAT GATACT 
AGT GAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATATCGT GT CATAT TT T CCC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
T TTT TT CC CTT GT GAGTTATAGT CTGCT TAT TTAATT AC CACTTTGCAAGCCTTACAAGAGAGCACAAGTT GGC CTAC 
ATTT TTATATTTTTTAAGAAGATACTT T GAGAT GCATTATGAGAACT TT CAGT T CAAAGCAT CAAATT GAT GCCAT AT 
C CAAGGACATGCCAAAT GCTGAT T CT GT CAGGCACTGAATGT CAGGCAT TGAGACATAGGGAAGGAAT GGTTT GTACT 
AATACAGACGTACAGATACTTTCT CT GAAGAGTATTT T C GAAGAGGAGCAAC T GAACACTGGAGGAAAAGAAAATGAC 
ACT TTCT GCTT TACAGAAAAGGAAACTCATT CAGACT GGTGATATCGTGAT GTAC CTAAAAGT CAGAAACCACATTTT 
CT C CTCAGAAGTAGGGACCGCTTT CTTACCT GT TTAAAT AAAC GAAAGT ATACCGT GT GAAC CAAACAATCT CTTTT C 
AAAACAGGGT GCT CCT CCT GGCTT CT GGCTTC CATAAGAAGAAATG GAGAAAAATATATATATATATATATATATTGT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAAT TATT TTT TAAATTAAGCAGTTCTACT CAAT CAC CAAGAT GCT T CT GAAAATT GCATTTT ATTACCATT T 
CAAACTATT TTTTAAAAATAAATACAGTTAACATAGAGT GGTTT CTT CATT CAT GT GAAAAT TATTAGCCAGCAC CAG 
AT GCAT GAGCTAAT TAT CT CT TT GAGT CCTT GCTTCT GT TT GCT CACAGTAAACTCAT T GTT TAAAAGCTT CAAGAAC 
ATT CAAGCT GTT GGT GT GT TAAAAAAT GCAT T GTATTGATTT GTACT GGTAGTT TAT GAAATTTAATTAAAACACAGG 
C CAT GAATGGAAGGTGGTATT GCACAGCTAAT AAAATAT GAT TT GTGGATAT GAA 
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FIGURE 72 

MMMVRRGLLAWISRVWLLV^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKIATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRP YTAS DF IEG I YRTERDKGTLYE LT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVI)KFRQFMQNFREMCIEQDGRVHLT^ 

T FI QLNGE FSRGKGLDVGARFWKGSNVLLFFCDVDI YFT SE FLNTCRLNTQPGKKVFYPVLFS QY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I WRT PVRGL FHLWHEKRCMDE L T PEQYKMCMQS KAMNEASHGQLGMLVFRHE I EAHL 
RKQKQKT S SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAAT GCCCAGTCGCCCAT CGAT AT T CAGACAGACAGTGTGACAT T T GACCCTGAT TT GCC 
0 TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
y ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
fl GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
||| T GAAGCCACAT T TGCAGAGCTCCACAT TGTACAT TATGACT CT GATT CCTAT GACAGCT TGAGT G 

M AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACAT TC T GAGT CACT TGCATGAAGT CAGGCAT AAAGAT CAGAAGACCTC 
!^ AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
J| GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
pi AT TT CAAT GGAACAGCTGGAAAAGCT T CAGGGGACAT T GT T CT CCACAGAAGAGGAGCCCT CTAA 

|f| GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
CI TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
HI TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAZVA 

CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCAT T CCAGGAAGAACTGCAGAGCC T T CAGCC TC TCCAAACAT GTAGGAGGAA 
ATGAGGAAAT CGCTGT GT T GTTAATGCAGAGANCAAACTCT GTT TAGT T GCAGGGGAAGT T T GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 7d 



MLFSMiLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVBYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 

WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245/ 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 

TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

ATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCATCACAA 

ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 

AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 

AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 

ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 

AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 

TCTGTGATGGGCTATTGCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 

GATTTTGATTTATGGAAATGAATTTGACAA7VAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 

ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 

AAAAGCAGT GAT GTAT CCAGCCTT AAT GAT C CTCAGCCCAGCGGGAACCT GAGGCCCCC TCAGGA 

GG AAG AG G AGG T GAAAC AT T TAG G G TAT GC T T C G CAT T T G AT GG AAAT T T T T T GT GAC T C T GAAG 

AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 

ACAGT CATT GAAT AT GAATATGATGT CAGAACCACT GACAT TT GT GCGGGGCCT GAAGAGCAGGA 

GC T CAGTT T GCAGGAGGAGGTG T CCACACAAGGAACAT TAT T GGAGT CGCAGGCAGCGT TGGCAG 

TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 

TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 

AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 

CCAGACAGGCCACCAGGAGAAAAT GAAACCTAT C TCATGCAATT CAT GGAGGAAT GGGGGT TAT A 

TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 

TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 

CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 

GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 

AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCS I SSQI GPPEVALTTDEKS I SWLTAPEKWKRNPEDLPVSMQQI Y 
SNLKYOTSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS S E FKAKI I FW YVLP I S I TVFL FS VMG YS I YRY I HVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAAT CT AGTGAAAAACCAGCT GT GT CCCG TGATCGAGGC T TCCTT CAAT GGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
T GAT CAAT GAAAAGGC TGCAGATAAGCT GGGATC TACCCAGATC GT GAAGATCCTAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T TACACCAAAGGTGACCAACT TAT ACTCAACT TGAATAACAT CAGC TC TGATCGGATCCAGC TG 
AT GAACTCT GGGATT GGC T GGT T CCAACC TGATGT TC TGAAAAACAT CATCACTGAGAT CATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLC PVIEAS FNGMYADLLQLVKVP I S LS I DRLE FDLL YPAIKGDT I QLYLGAKLLDS QGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNI S SDRI QLMNSGIGWFQPDVLKNI I TE I IHS I LLPNQNGKLRSGVPVSLVKALG 
FE AAE S S L TKDALVL T PAS LWKP S S PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGT CAGCC T GGCAGAGAGAC TCTGAAATGAGGGAT TAGAGGTGT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
^ CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
f*! GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
P ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
f] CAT GAAAT T T GAGAT TGGAGAGGCT CT T TACT T GGGCAT TAT T T CT T CCC T GTTCTCCC TGATAG 

8 5; 

HI CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

Ti. CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

S| GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

5 GGT C TGT GAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACAC TACCAC TGGATCGT 

W GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 

1 1! 

GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 

w 

TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
jjj. GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
III TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T T GTGCAT GAAC T GAAAT AAAAC CATCCTACGGTAT CCAGGGAACAGAAAGCAGGATG 
CAGGAT GGGAGGACAGGAAGGCAGCC T GGGACAT TTAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVT S SAI S S LAC 1 1 S WGMRC TVFCQE S RAKDRVAVAGGVFF I LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCT'CTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 

CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 

GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 

GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 

ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 

CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCGTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGACCCAAGGGAGCAGGCAGCACAACTG T CAAGAT CGTCC TGAAGGAGAAACATAAGAAAGCCT 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 

CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 

CTCGGACT T GGTGGAGATC TACCT CT GGAAGCTGGTAAAAGAT GAGGAAACT GAGGC T CAGAGAG 

GTGAAG TACCTGGCCCAAGGCCACACAGCCAGAATC T TCCACT T GAG TCAGATCAAGAAAGTCAG 

GAAGCAAGACT T CCAGAAAGAGGCACAGCAC T TCCGACTGC TCGC TGGCCCCCACGAAGG TCACT 

GGAACGTCT TCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGT CCAGACAAAGTGACCAAG 

ACAT AAC AAAGAC C TAAC AG T T G GAGAT AT GAGCTGTATAATT GT TGT TAT TAT AT AT TAATAAA 

TAAGAAGT T GCAT TACCC TCAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8ft 



GACAGC TGT GTCTCGATGGAGT AG AC TCTCAGAAC AGC GCAGTT TGCCCT CC GCTCACGC AG AGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGT T TCCATGCCGTGAGGTCCATT CAC AGAAC AC AT CC ATGGCT CT CATGCT C AGT TT GGTT CTGAGT C 
TCCTC AAGCT GGGAT C AGGGC AGT GGCAGGTGT TT G GGCC AGAC AAGCCT GT CC AGGCCT TGGT GGGG GAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGT T CT CT AGCGT GGT CCACCT CT AC AGGG ACGGGAAGGACC AGCCAT TT AT GC AG AT GCC AC AGT AT C 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GT GT T GGATGCT GGCCTCT AT GGGTGCAGG AT T AGT TCCC AGTCTT ACT ACCAG AAGGCC AT CTGGGAGCT 
ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AG ACTCCAGGACAAAC AGAGAC AT GC AT GGCCTGTT T GAT GT GGAG AT CTCT CT G ACC GTCC AAGAGAA 
CGCCGGGAGC AT ATCCTGT T CC AT GC GGC ATGCT CATCTG AG CCGAGAGGTGGAAT CC AGGGT ACAGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTT TT GGC ATT GT T GGACT GAAG AT TT TCTTCT CC AAAT TCCAGT GGAAAAT CCAGGCGGAACTGGACTG 
GAGAAG AAAGCACGGACAGGCAGAAT TG AGAG ACGCCCGGAAAC ACGCAGTGGAGGT G AC TCTGGATCCAG 
AG ACGGCTCACC CGAAGCT CTGCGTT TCTGATCT GAAAAC T GT AACCC AT AGAAAAGCT C CCCAGG AG GTG 
CCTC ACT CTGAG AAGAGATT TAC AAGGAAGAGTGTGGT GGCT T CTC AGAGTT T C CAAGCAGGGAAACATTA 
CT GGGAGGT G GACGGAGGACACAAT AAAAGGTGGCGCGT GGGAGTGTGCC GGGAT G AT GT GGAC AGGAGG A 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 

TGAGT GTGGGAC CAT C TCCTTC TT CAAC AT AAAT GACCAGTC CCTT AT T T AT ACCCTG AC ATGT CGGT TT G 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCA^AATGGAACTCCCATAGTCATCTGC 
C C AGTC ACCC AGGAAT CAGAGAAAG AGG CCTCTT GGC AAAGGGCCT CT GC AATCCC AG AGAC AAGCAACAG 
T GAGTCCT CC TC AC AGGC AACCACGCCC TT CCTCCCC AGG GGT G AAAT G TAGGATG AATC ACATCC CACAT 
TCT T CTTT AGGGAT AT T AAGGT CT CT CT CC C AGATC C AAAGT CCCGCAGC AGCCGGCC AAGGTGGC TT CC A 
GATG AAGGGGGACT GGCCT GT C CACATGGG AGT C AGGTGT C ATGGCTGCCCT GAGCTGGGAGGGAAGAAGG 
CT GAC AT TAC AT T T AGTT TGCT C T C ACT CC ATCT GGCT AAGT GATCTT GAAAT ACCACCT CTC AGGT GAAG 
AACCGTCAGGAATT CC CATCTC AC AGGCT GTGGT GT AG AT T AAGTAGAC AAGGAATGT GAAT AATGCTT AG 
AT CT TATT GATGAC AG AGTGT ATCCT AATG GT TTGTT C AT TAT AT T ACACTT TC AGT AAAAAAA 



FIGURE 8a 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
?** PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

%?| Signal peptide: 

JJJ amino acids 1-17 

VI. Transmembrane domain: 

* amino acids 239-255 

l.p3 
P 
III 

£3 
III 
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FIGURE 85 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAC ATG CTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCGATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGG^AGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCC TT GGGACTCGCGGGGACAGGAGGCCAC TGACACCGAGTACT CGGAGAT CAAGAT CCACAG 
ATGAGAAACTGCAGAGACTCACCCT GAT T GAGGGAT CACAGCCCCTC CAGGCAAGGGAGAAGTCA 
GAGGCTGAT TC TTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTATG 
T GCAGAGT GAAAAGCACACAGGCT T TAGAGTCAAAGTATC TCAAACCT GAAT CCACAC TGTGCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
i\NTDQDAPVATISnSfPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGS QQVYLNVS LQS KAT S GVTQGWGGAGATALVFL S FC V I FWVRS CRKKSARPAAGVGDTG I E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 



£1 YSEIKIHR ' 

fl! 

: :r : Signal peptide: 

3 s: 

fi% amino acids 1-15 

■ ?;r 3 

s ?| Transmembrane domain: 

amino acids 351-370 



rii 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACT TACT TCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
T GGCCT GT AT T T TC TCCGCAC T GAGAAT GGT GT TAT C TACCAGACCT TCT GTGACAT GACC TCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
y ; GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
PI CAACTACAACACCT TTGGATCT GCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 

Cl ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 

m 

r: AGAAACAGCT CCCT GC TGAGGTACC GCACGGACACT GGC T TCC T CCAGACACTGGGACATAATC T 

3 ^ £ 

G T T T G GCAT C TACCAGAAATAT C CAG T GAAAT AT GGAGAAGGAAAG T G T T GGAC T GACAAC GGC C 

T= CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 

SI GGCCAGCGGGAAT TCACTGCGGGAT T T GT T CAGT TCAGGGTAT T T AATAACGAGAGAGCAGCCAA 

* CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

Jj; GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

f?l GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 

I p| T TGAGAGT T T TGTGGGAGGGAACCCAGACCT CT CCT CCCAACCAT GAGAT CCCAAGGATGGAGAA 

P CAAC T T ACCCAGTAGC TAGAAT GTT AATGGCAGAAGAGAAAACAATAAAT CATAT TGAC TCAAGA 

ill AAAAAA 
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FIGURE 88 



MNQLS FLLFLI AT TRGWS TDEANT YFKEWTCS S S P S LPRS CKE I KDE CPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

GlycosaminoglYcan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites . 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

C T AGAT T TG T CGGC T TGCGGGGAGACT T CAGGAGT CGC T GTCT CT GAACT T CCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACT TATATACAGT TT TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNS TQSLI S I GAHCSASVALS FFI FERWECTTYW YI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAG ATG CAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCGCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGT CCCAAGTGCTGG T CCCAACCTGAAGC TG TGGAGT GACT AGAT CACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CAT GAGGC TGAAGT GGCAACCC TGGGGT C T TT GAT GT CT T GACAGAT T GACCATC TGTC TCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT CC TTAAGCTAAGACAGGACGAT TGTGG TCC TCCCACAC TAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTG7\AGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
S VHFQHNAT GQGN ISIS LVP PSKAVE FHQE QQIFI EAKASK I FNCRMEWEKVERGRRT SLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of* the protein: 
Signal peptide: 

I f . amino acids 1-14 

M 

[11 N-glycosylation sites. 

Ill amino acids 62-65, 127-130, 137-140, 143-146 

l , 2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 



M 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATG T TCCGAT T TGCATAT TATAAAC T CT T AAAAAAAGCCAGTGAAGGTT TGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAG CCTGAAAC TCT GCC T GCTCT GCCAAGACAAGAAC TTTCTTCTT T ACAACCAGCGC TCCAGAT 
AAC C T C AGG GAACCAGC AC T T C C CAAAC C GCAGAC T ACAT C T T T AGAGGAAGCACAAC T G T GCCT 
T T T T CTGAAAATCCC T T T T TCT GGTGGAAT TGAGAAAGAAATAAAACTAT GCAGATA 
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FIGURE Q4 



MTAAVFFGCAFI AFGPALALYVFT I AI E PLRI I FL I AGAFFWLVSLL I SSLWFMARVI I DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFG IMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 



r = Transmembrane domains : 

PI 

f|| amino acids 32-51, 119-138, 152-169, 216-235 

11 Glycosaminoglycan attachment site. 

^ amino acids 120-123 

?n Sodium: neurotransmitter symporter family protein 

m 



amino acids 31-65 
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FIGURE 

AAT TT TTCACCAGAGTAAACTT GAGAAACCAACTGGACCT TGAGTATTGTACATTT TGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAAT CAGGT C TTTCCTTCTT TAAGT CT GATACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAAT TT T T GT CACACAAC T T GGAGCCCAGGGCAC TATCC TAAGCT CAGAGGAAT TGCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGAT TATAT TATGGAATAGATTGAGACACAT TGGATAGTCT TAGAAGAAATTAATTCTTAAT T 
TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 
CAATAAT T CAATGGATAAATCT GTCT T TGAAATATAACAT TAT GCTGCCT GGAT GATATGCATAT 
T AAAACATAT T TGGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKIiAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAA.GMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 



Signal peptide: 

amino acids 1-16 



ill 



■■: $ • 

r:! 

m 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 

CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 

GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 

AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 

CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 

GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 

GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 

AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 

AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 

GGAT GAGCCAGGCACAGCCAACT CGGAT CT T CGAT T CCACAT CCT GAGCCAGGC TCCAGCCCAGC 

CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 

AC CAGC C T T GACCAC GC C C T G GAGAGGACC T ACC AGC T G T T G G T ACAGG T C AAG GACAT GG G T G A 

CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 

CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 

CACTGGAGTGGGGGTGATGTGCACTATCACGTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 

T GCAGAGGGAAACC TCTACGT GACCAGAGAGC T GGACAGAGAAGCCCAGGCT GAGTACC TGCTCC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 

ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 

CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 

CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 

T CGAAGT CGCAGTCACAGATAT CAAT GAT CACGCCCCTGAGT T CATCACT T CCCAGAT TGGGCC T 

ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 

CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 

GCCT GGAT T GGGAGCCAGACTCT GGGCAT GT TAGAC T CAGACT CTGCAAGAACC TCAGT TAT GAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 

AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 

CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 

ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 

TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 

GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 

GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 

GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 

CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 

CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 

ATTT TCACCCAC TGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 

GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 

TCCCCT GGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCC T CCA 

TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 

TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 

CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQWDMGDQASGHQATATVEVSIIESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQE SYEASVP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI I PWVSHNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMPTKLSAVGI LVGTLVAI G I FLI 
LIFTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT CTAGGACCCAGGCATCTTGCTT TCCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAATG 
TT C T CCT TAT G T T TGGT C TACT AT TGCAT T TAGAAGCTGCAACAAAT T CCAATGAGAC TAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAG T CACCAAC TCTGAGTT CCATACAACC TCCAGT GGGAT CAGCACAGCCACCAACTCTGAG 
T TCAGCACAGCGTCCAGT GGGAT CAGCAT AGCCACCAAC TCTGAGT CCAGCACAACC T CCAGTGG 
i, t GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCC TCCAGT GGGGCCAGCACAGTCACCAACT 

pi CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
W AGTAGGGCCAGCAC T GC CACCAAC T CT GAGT C TAGCACAC TCT CCAGT GGGGCCAGCACAGCCAC 

Kl CAAC TCT GACT CCAGCACAACCTC CAGT GGGGC TAGCACAGC CACCAACT CTGAGTCCAGCACAA 

|f! CCT CCAGT GGGGCCAGCACAGCCACCAACT CTGAGT CCAGCACAGT GT CCAGTAGGGCCAGCAC T 

|g | GCCACCAACTC TGAGTCCAGCACAACCT CCAGTGGGGCCAGCACAGCCAC CAACT CTGAGT C CAG 

IS AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
p GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
Si GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACC TCCAGT GGGGCCGGCACAGCCACCAACT 

CTGAGT CCAGCACAGTGTCCAGT GGGAT CAGCACAGTCACCAAT TCTGAGT CCAGCACACCCTCC 
l n AGT GGGGC CAACACAGC CACCAACT C T GAGT CCAGTACGACCT CCAGT GGGGCCAACACAGCCAC 

] ! |f CAACTCT GAGTCCAGCACAGTGTCCAGT GGGGC CAG CAC TGC CACCAAC T C TGAGTCCAGCACAA 

111 CCT CCAGT GGGGTCAGCACAGCCAC CAACT CTGAGT CCAGCACAACC TCCAGTGGGGCTAGCACA 

|5 GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
| £ | CAC AG T G T C CAG T GGGAT CAG CACAG T C ACC AAT TCT GAG T CCAG CAC AAC C T C CAG T GGG GCC A 

5; ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
2f ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
ill GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 

TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCC TGA GCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCT TTCT TGCC TT TACCAGACACTGGAAAGAGAATACTATAT 
T GC T CAT T TAGCTAAGAAATAAATACAT CT CAT C TAACACACACGACAAAGAGAAGC T GTGC TT G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAA&AAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSS'GISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGAS TATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TN'SESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGAS TATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTS SEAS TATNSESSTVS SGI STVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTT.SHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 



AppJD=10063617 



Page 252 



FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 

CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 

GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 

GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 

AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 

GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 

TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAAGCCATCTTGGTATCGCTGGATCAACAAC 

CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

GAT T AT AAC T GGGGAT GCAT TTGTTCCTG GAGAAAGAAG T GT CAT TAT CAT GAAC CAT C GGAC AA 

GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 

TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

C T T CAT T C AT AGGAAAT GGAAG GAT G AC AAGAGC C AT T T C GAAG AC AT GAT T GAT TAC T T T T GT G 

AT AT T C AC GAAC C AC T T CAAC T C C T CAT AT T CC C AGAAG GGAC T GAT C T C AC AGAAAAC AGCAAG 

TCTCG7\AGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 

AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 

ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 

CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 

GGGAGAAGAAT T T TTATT T TACCGGACAGAGT GTCAT TCCACCT TGCAAGT CTGAAC TCAGGGTC 

CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 

ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 

GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 

T T AftAT T CAAAGAAAAATGAG TAAGAT TATAAGG TT TGCCATG TGAAAACC TAGAGCATAT T T TG 

GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 

GCCAT CAT TAT T TGT TAAAGATATT T T GCACT TAAT T T TGTGGGAAAAATAT TGC TACAAT T TT T 

TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 

GGGC CAGAAT AT TAT TAAACAATCAT CAGGC TT T TAAZl 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRW INNRLVATWLTLPVALLETMFGVKVI I TGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
Y FT GQS VI PPCKSELRVLWKLLS I LYWTLFS PAMCLL I YLYSLVKWYFI I T IVI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAAT CATCCATCCACCCCTGCTGTCAT CTGT TT TCATAGTGTGAGAT CAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CAC C T C TACAGAGATGGGGAAGACT GGGAAT C T AAGCAGAT GCCACAGT AT CGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCGCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGAT T T GT CTTCAGAC T CCAGAGCAAAT GCAGATGGGTACAGCCTGTATGAT GTGGAGATCT CC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAAT CCAAAGGGAAAAT CCAGGCGGAACTGGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T T T C TGAT CT GAAAACT G TAACCCAT AGAAAAGCT CCCCAGGAGGT GCCT CAC T CT GAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGT GGGACAAAATGTAGGGTGGTAT GTGGGAGTGTGTCGGGAT GACGTAGACAGGGGGAAGAACA 
AT G T GAC T T T G T C T CCCAACAAT GGG T AT T G GG TCC T CAGAC T GAC AACAGAACAT T T G TAT T T C 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GACATGT CAGT T TGAAGGC TT GT T GAGACCC TATATCCAGCAT GCGAT GTATGACGAGGAAAAG 
GGGAC T CCCATATT CATATGTCCAGT GT CCTGGGGATGAGACAGAGAAGACCC T GCT TAAAGGGC 
CCCACACCACAGAC CCAGACACAGCCAAGGGAGAGTGCT CCCGACAGG TGGCCCCAGC T TC C TCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
CT CCATCCAGC TAAGCGAT C T T GAACAAGT CACAACC TCCCAGGCT CC TCAT T TGC TAGTCACGG 
ACAGT GAT T CC TGCC TCACAGG T GAAGATT AAAGAGACAACGAATGTGAATCATGCTT GCAGGT T 
TGAGGGCACAGTGT TTGCTAAT GATGTGTT TTTATAT TATACAT TTTCCCACCATAAACTCTGT T 
T GCT TAT T CCACAT TAAT T TAC T T T T CT C TATACCAAATCACCCAT GGAAT AGT TAT TGAACACC 
T GCT TTGTGAGGCTCAAAGAATAAAGAGGAGGTAGGATT TT TCACTGATTCTATAAGCCCAGCAT 
TACCT GAT ACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAAC T ACAGGTCCATATCC 
C T CAT T AAC ACAGAC ACAAAAAT T C TAAATAAAAT T T T AAC AAAT T AAAC T AAACAATATAT T TA 
AAGATGAT ATATAAC TACTCAGT GT GGT T T GT C CCACAAATGCAGAGT TGGT T TAAT AT T T AAAT 
ATCAACCAGTGTAAT TCAGCACATT AAT AAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVLI LVLS FYELVS GQWQVTGPGKFVQALVGEDAVFS CS L FPE TSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAPCWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRIASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQOTGWWGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CC TT CACAGGAC TCT T CAT T GCTGGT TGGC AATGAT GT AT CGGCCAGAT GTGGTGAGGGCTAGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT T CAT TATGTG AG AT ATAATC AAAAGAAG AC CTACAATT ACT AT AGC AC AT T GTC AT TT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTG AAAAAT GC AT TTT AT AAATCT C CAT T AAGGGAAGAATT T GTC AAGTCT CAGGTT AT C AAGTT C 
AGTC AACAGAAGCAT GGAGT GTTGGC TC AT ATGCTGTT GATT T GT AGAT T TCACTCT ACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CCT C AC TCAGTT AAAATT AAAAAAAT CAACAAGACAGAAAC AGAC AGCT AT C T AAACC AT TGCTGC GGA 
AC ACGAAGAAGT AAAACT CT AGGT CAGAGT CT CAGGAT CGTT GGT G GGAC AGAAGT AGAAGAGGGTGAAT G 
GC CCTGGC AGGC TAGCCT GC AGT G GG ATGGGAGTCAT C GCTGTGGAGC AACCTT AAT T AATGCC ACAT GG C 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACCT TCGAAAATG AAACGGGGT CT C CGGAGAAT AATT GT CC AT GAAAAATACAAACACCCAT C AC A 
TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TC CCT G ATGC AT CCT ATG AGTTTC AACC AGGTGAT GTGATGT TT GT GAC AGGAT TT GGAGC ACT GAAAAAT 
GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGCAATGAACC 
T C AAGC TT AC AATGACGCC ATAACTCCT AG AATGT T AT GTGCTGGCTCCT T AGAAGGAAAAAC AGATGC AT 
GC CAGGGT GACT CT GGAGGACC AC TG GT T AGT T CAGAT GCTAGAGATATCTGGT ACCTTGCT GGAAT AGT G 
AGCT GGGGAG AT GAAT GT GC GAAACC CAAC AAGCCT GGT GTT TAT ACT AG AGT T ACGGCC TT GCGGGACT G 
GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGT G GAGGCC AT TTT T AG AGAT AC AG AATTGG AG AAGAC T TGCAAAAC AGCT AGATTT GACT GATC TC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACT CT GT CAT C TGT GAG CAAT AGT T GAAACT TT ATGT AC AT AGAG AAAT AGAT AAT AC AAT AT TAC 
AT TACAGC C T GT AT T C AT TT GT TCTCTAGAAGTTTT GT CAGAAT TTTGACTTGTTG AC AT AAATTT GT AAT 
GC AT AT AT AC AATT T GAAGC ACT C CT TT TCT TCAGT TC CTC AGCT CCT CT CAT T TC AGC AAAT ATCCATT T 
T C AAGGTGCAGAAC AAGGAGTGAAAG AAAATATAAG AAGAAAAAAATCCC CT AC AT TTT ATTGGCACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GC AGAAT ACC AAT CAC TT CATC AT T TAGGAAGTATGGGAACT AAGTT AAGGAAGTCCAGAAAGAAGCCAAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AATT AT AC AAACT T CAT GC AAT GTACTTGT TCT AAGC AAATTAAAGC AAAT ATT TAT TTAACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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I^RPDVVRARKRVCWEPWIGLVIFISLIVLAVCIGLTVHYVRYNQPCKTYNYYSTLSFTTDKLY 
AEFGREASIMFTEMSQRLESMVKNAFYKSPLREEFVKSQ 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNEPQAYNDAI T PRMLCAGS LEGKTDACQGDS GG PLVS S DARD I W YLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT T C T CAC T T TAAC TGCAGAAACGAGGGTAGAGGAAGCAGT CAT T TT GACT TACT T TCC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAT^ATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GT GTAGAACTGGC T TGT GGCGT TT GGACAT ATGAACAGGAACT TAT GGT T CCAGTACAAT GGTCA 
3 . GAT AT GGT CAC T T T GAAAGC CAGGAT GACAAAT TAT GGAT T AC C T AGAT AT CGGTGGCT TACT CA 

!** TGC T T GGAAT TTTT T TCAGAGAGAGTT TAAGT GCTGTGGAGTAGT AT AT T T CACTGACT GGTT GG 

M AAAT GACAGAGATGGAC TGGCCCCCAGAT TCC TGC TGTGT TAGAGAATT CCCAGGAT GT TCCAAA 

P CAGGCCCACCAGGAAGAT CT CAGTGACC T TTATCAAGAGGGT T GT GGGAAGAAAATGTAT T CC T T 

f|l TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
|1| TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
JSC ACAGACCAAAT GATGTCC T TGAAGAATGACAAC TC T CAGCACCT GT CATG TCCCTCAGTAGAACT 

'f 1 GT T GAAACCAAGCC T GT CAAGAATC T T T GAACACACATC CAT GGCAAACAGCT T TAATACACAC T 

H T TGAGAT GGAGGAGT TATAAAAAGAAAT GTCACAGAAGAAAACCACAAACT TG T T T TATT GGAC T 

%l TGT GAATT TT TGAGTACATACTAT GT GTT TCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 

TAACACCT AAGCATATAC TAT T CTAT GCT T TAAAATGAGGATGGAAAAGT T TCATGT CATAAGT C 
1- ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
W TGT GTATGAC T T T TACT GAACACAG TT AT GT T T TGAGGCAGCATGGT TT GATTAGCAT T TC CGCA 

ill TCCATGCAAACGAGT CACATATGGT GGGAC TGGAGCCATAGTAAAGGT T GATT TACT TCTAC CAA 

P CTAGTATATAAAGTACTAATTAAAT GCTAACATAGGAAGTTAGAAAATACTAATAACT TTTATTA 

iJl CTCAGCGAT CTAT T C TT C T GAT GCTAAATAAAT TAT AT AT CAGAAAACTT TC AATAT T GGTGACT 

% ACC TAAATGTGAT T T TT GC TGGTT AC TAAAATAT TC T TACCACT TAAAAGAGCAAGCTAACACAT 

TGTCTT7\AGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
III T TCAGT T C T GATAATGT TAAGAATAACCAT TATGAAAAGGAAAATT T GT CCTGTATAGCAT CAT T 

ATT T T TAGCC T T TCCTGT TAATAAAGCT T TACTAT TC TGTCCT GGGC T TAT AT TACACATATAAC 
TGT TAT T TAAAT AC TTAACCACTAAT T T T GAAAAT TACCAGTGT GAT ACATAGGAAT CAT TAT T C 
AGAATGTAGT C TGGTC T T TAGGAAGTAT TAATAAGAAAAT T TGCACATAAC TTAGT TGAT T CAGA 
AAGGACT T GT ATGC TGT T T TT CTCCCAAATGAAGAC TCT T T TT GACAC TAAACAC TTTT TAAAAA 
GCT TAT CTTTGCCTTCTC CAAACAAGAAG C AATAG T C T CC AAG T C AATAT AAAT T C T ACAGAAAA 
TAGT GTTCTTT TTCTCCAGAAAAAT GCTTGTGAGAATCAT TAAAACATGTGACAATTTAGAGAT T 
CTTTGTTT TAT T T CACTGATTAATATAC T GTGGCAAAT TACACAGAT TAT TAAAT T T T T T TACAA 
GAGTATAGTATAT T TAT T T GAAATGGGAAAAGTGCAT T T TAC T GTAT T TT GTGTAT T T TGT T TAT 
T TCTCAGAATATGGAAAGAAAATTAAAATGTGTCAATAAATATT TT CTAGAGAGTAA 
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MAREDSWCLRCLLYALNLLFWIMSI 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVIFCVELACGWTYEQELMVPVQWSDMVT 
LKT^TOTGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAAC ACT GACC ACCAGTTGGATGAGTCT CAAGGTCCTCCT CTAT GTGAC AACCAT GT GAAT GGGGAG 
TGGT ACCACTTCACGGGC AT GGC GGGAGAT GCCAT GCCT ACCTTCT GCAT ACCAGAAAACCACT GTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
. , TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
* f n CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 

%l CAGCTGCTCAGATACCAGC GAGT GCACATGC GCTCCAGGAACT GT GCT AGGC CCTGACAGGCAGACAT GCTTT G 

hi 

m ATGAAAATGAATGT GAGCAAAACAACGGTGGCT GCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCT GT 

r.r' 

3 . ■ GAGT GT G GGGT T GGC C GT GT GCT AAGAAGT GAT GG C AAGAC T T GT GAAGAC G TT GAAGGAT GCC AC AAT AAC AA 

M 

m TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
l& CT GAGGAT AACCACACTT GCCAAGTCCCT GT GTT GT GCAAATCAAATGCCAT TGAAGTGAACATCCCC AGGGAG 

SJ CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
$ CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
y CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
||I ACC T GC GAGT T TCC AC GC C T GT AC ACC AT T T CT GAAGGAT AC GTTCC C AACC T T C GAAAC T CCC C ACT GGAAAT 

|el CAT GAGC C GAAAT CAT GGG AT CT TC CC AT T C ACTCT GGAGATC T T C AAGGAC AAT GAGT T T GAAGAGC CTT ACC 

III GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
M TTGGAAAGCTT GGTGGAGAGCT GCTTT GC C ACCCC C ACCT C CAAGAT C GAC GAGGT CCT GAAAT ACTACCTC AT 

*** CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGT GGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGAC TAG TTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
C CC AGAAAGAC ACT C ACCC C AT TT CCC T CAT T T CT TT CC T AC AC T T AAAT AC CT C GT GT AT GGT GC AATC AGAC 
C AC AAAAT CAGAAGCT GGGT AT AAT AT T T C AAGTT AC AAACC C T AGAAAAAT T AAAC AGT TACT GAAAT TAT GA 
CTT AAAT ACCC AAT GACTC CT T AAAT AT GT AAATT AT AGT TAT ACCT T GAAATT T C AAT T CAAAT GC AGACT AA 
TTATAGGGAAT TT GGAAGT GT AT C AAT AAAAC AGT AT AT AATT TT 



App_ID=10063617 



Page 261 of 320 



FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPWI^GSHPLEGDGIVQRQACASFNGNCCLlimTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CE FPRL YT ISEGYVPNLRNSPLEIMSRNHGI FP FTLEI FKDNE FEEPY REALPTLKLRDSLY FGIE PWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 
amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites, 
amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy la tion site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGG C AGC AGCT TGCT CAGCGG ACAAGGATGCT G GGCGTGAGGGACC AAGGCCT GCCCTGC ACT CGG 

GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 

CCTGCT GCCTT GGGGT GAC AAT CT C AGCT C CAGGCT AC AGGGAGACCGGGAGGATCAC AG AGCC AGCATGT 

TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 

AT GGAG ACCTTC AGAAAGGT GGGG AT CCCC AT CATC AT AGCAC T ACTGAGCCT GGCGAGT AT CATCAT TGT 

GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 

GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 

CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 

AG GGAACT GGT T CT CT GCCT GTTT CG AC AACT TCAC AG AAGCT C TCGCT G AGAC AGCCT GT AGGCAGATGG 

GC TACAGC AGAGCT GTGG AGATTGGCCC AGAC CAGGAT CTGGAT GTTGT T GAAATC ACAGAAAACAGCCAG 

GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 

TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 

GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 

CTTCCC AT CCCT GGCTGT GGCCAAGAT CAT CAT C ATTG AATT CAACCC C ATGT ACC CCAAAG AC AAT GAC A 

TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 

GATGAGGAGC TC AC TCCAGC CACCCC ACTCTGGAT C AT TGGATG GGGCTT TACGAAGC AGAATGGAGGGAA 

GATGT C T G ACAT ACT GCT GC AGGC GT CAGT CC AGGT CAT TGACAGCAC ACGGTGCAAT GC AGACGAT GCGT 

ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 

GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

CTGCGGGGGC CCGAGC AC CCCAGGAGT ATACACC AAGGT C TCAGCCT ATCTCAACT GGAT CT ACAAT GTC T 

GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 

GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 

T T CTTGGAGC AGC AAAGGGCCT CAAT TCCT GT AAGAGACCCT CGCAGCCC AGAGGCGCCC AGAGGAAGTC A 

GC AG CCCT AGCT CGGCCACACT TGGT GCTC CCAGCATCCCAGGGAG AGAC AC AGCCC ACT GAACAAGGTCT 

C AGGGGTAT T GCTAAGCCAAGAAG GAACT T TCCC AC ACT ACT GAAT GGAAGCAG GCTGT C TT GT AAAAGC C 

CAGATC ACTGTGGGCT GG AGAGGAGAAGGAAAGGGT CT GC GCCAGC CCTGTCCGTCTTCACC CATCCC CAA 

GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 

ACTGTT GTCATT GTT AT T AC AGCT AT GG CC ACT ATT AT T AAAGAGCT G TGT AAC AT CT CT GGCAAAAAAAA 

AAAA 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSIAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWG I VS WGYGCGGPS TPGVYTKVS AYLNW I YNVWKAEL 

Transmembrane domain: 




amino acids 32-53 (typell) 



App_ID=10063617 



Page 264 of 320 



FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAAT TAT GGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGC T GCTGAAGGGCAAC TGCAGGCCGAT GC TCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C TCAAT T TAAAT CAT GT TC TAGT AAT T GGAGC TGTCCCCAAGACCAAAGGAGC TAGAGC T TGGT T 
CAAAT GATCT CCAAGGGCCCTTATACCCCAGGAGAC T T T GAT T T GAAT TT GAAACCC CAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAAT CACTT CAGCC T GAGCAACACAGCGAGACTC TGT C TCAGAAAAAAT AAAAAAAGAAT T A 
TGGT TAT TTGTAA 



FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 

CAGCAGT GGT C TC T CAGTCCTCT CAAAGCAAGGAAAGAGTACT GT GTGCTGAGAGACCATGGCAA 
AGAATCCTCCAGAGAAT TGTGAAGACT GTCACATTCTAAATGCAGAAGCTT TTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTT T TGGGGGAGCAAGCACT TC TGGCCGGAGGTACCCAAAAAAGCC TATGACAT GGAGCACACT T 
T CTACAGCAATGGAGAGAAGAAGAAGAT T TACATGGAAAT TGATCCTGT GACCAGAAC TGAAATA 
TTCAGAAGCGGAAATGGCACTGAT GAAACATTGGAAGTGCACGACT TTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACT TTCT TTGAACAGTCAGTG 
ATT TGGGT CCCAGCAGAAAAGCCTAT TGAAAACCGAGATT TT CT TAAAAAT T CCAAAAT TC TGGA 
GAT T T GT GAT AACG TGACCAT GTAT T GGATCAATCCCAC TCTAATAT CAGTT TC TGAGTTACAAG 
ACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGAAAAAAAAGGGATTGAACAAAAT 
GAACAGTGGGTGGTCCC T CAAGTGAAAGTAGAGAAGACCCGTCACGC CAGACAAGCAAGT GAGGA 
AGAACT TCCAATAAATGACTATACTGAAAATGGAATAGAATTTGATCCCATGC TGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAA.TAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTAT CACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSEPEEE I DENEE I TT T FFEQSVT WVPAEKP IENRDFLKNS KI LE I CDNVTMYWINPTL I S VSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 



m 



m 



Important features of the protein: 
Signal peptide: 

amino acids 1-40 



Transmembrane domain: 

!M amino acids 25-47 (type II) 

%i 

N-glycosylation sites. 

ft amino acids 94-97, 180-183 

m 

Wl Glycosaminoglycan attachment sites. 



amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 



Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 

GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TC TGTGTT TGCC AACATGCT GGT GACT AACTT CTGGAT GTCCAC AGCT AACAT GT ACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGT CC AAC ACCAAAAAC AAGAAGAT AT ACGATGG AGGT GCCC GC ACAGAGGACGAGGT AC AATCT T AT C 
CT TCCAAGCACG ACT ATGTGTAATGC TCT AAGACCT CT CAGC ACGGGCGGAAGAAACTCC CGGAGAGCT C A 
CC CAAAAAAC AAGG AG ATCCCAT C TAGATT TC TT CT TGCT TTTGACTC ACAGCT GGAAGT TAGAAAAGCC T 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTG AGTT AT TT AT GAATTAGAGGCT AT AGCT C AC ATTTTCAAT CCTCT ATTTCTT TTTT T AAAT ATAACT 
TT CT ACT C TGAT GAGAGAAT GT GGTT T T AATCTCT C TCT C AC AT TTTG AT GAT T TAGACAGACTCCC C CTC 
T T CCTC CT AGTC AAT AAACCCATT GATG AT CT ATT T CCCAGCTT AT CCCC AAGAAAACT T TT GAAAGGAAA 
GAGT AGACCC AAAGAT GTT AT T TT CT GCT GTT TGAAT T T T GTCT CCCCACCCCC AACT TGGCT AGT AAT AA 
AC ACTT ACTG AAGAAG AAGC AAT AAGAG AAAG AT AT TT GT AATCTCTCCAGCCC AT G ATC TCGGTT TT CTT 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT T GTC CCTAGAT G AAT GAGAAAATT AT TTTT TTT AAT TT AAGT CCT AAAT AT AGT T AA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAATGAAAAAAT AAT TGCTTT GAC AT T GTCT AT AT GGT ACT T T GTAAAGTC AT GCTT AAGT ACAAAT TCC 
ATGAAAAGCT CACACC TGT AAT CCT AGC AC TT T GGGAGGCTGAGGAGGAAGG AT C ACTTGAGCCCAGAAGT 
TCGAGACT AGCCTGGGCAAC AT GGAGAAGC CCTGT C TCT AC AAAAT ACAGAGAGAAAAAATCAGCCAGT CA 
T GGTGGCAT ACACCTGTAGTCCC AGC AT TC CGGGAGGC TGAGGT GGGAGGATCACT T G AGCCCAGGGAGGT 
T GGGGCTGCAGT GAGCCATGAT CACACC ACT GCACT CCAG CCAGGT GACAT AGC GAGATCCT GTCT AAAAA 
AAT AAAAAAT AAAT AATGGAAC AC AGCAAGT C CT AGGAAG TAGGTT AAAACT AATT CTT T AA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRAmiVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMS TANMYT GMGGMVQT VQTRYT FGAAL FVGWVAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTT T T GAAAACT T CTGGGAAGGACT GTGGAT GAAT T GCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGT G C AC G GGG GACAAT GAGAAGG T GAAG GC T C ACAT T CT G CT G ACG GCT GGAAT CAT C T T CAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACT CAATAGTGAATGT TGCCCAAAAACGT GAGCTT GGAGAAGCTC TCTACT TAGGAT GGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT CGATACCT TCCCATCGCACAACCCAAAAAAGTTAT CACACCGGAAAGAAGTCACCGAGCG 
TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AATCT TAATTACAGGAACTGTGCAT CAGC TATT TAT GAT TCTATAAGCTAT T TCAGCAGAATGAGATA 
TTAAACCCAATGCTT T GATT GTTCTAGAAAGTATAGTAATTTGT T TTCTAAGGTGGTTCAAGCAT CTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCAT TAT GTACATAGATGAGT GTAACAT TTATATCTCACATAGAGACAT GCT TAT AT GGT 
TTTATTTAAAAT GAAAT GCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
AT CAT G GAT AGGG T T GAAGAAGGTT ACTAT TAATT GT TT AAAAACAGCT TAGG GAT T AAT GT C CT C CA 
TTTATAATGAAGATTAAAAT GAAGGCT TT AAT CAGCATT GTAAAGGAAATT GAATGGCTTT CT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
T TC CAGGG C TAT ACT CAGAAGAAAGAT AAAAG T GT GATC TAAGAAAAAGT GAT G GT T TTAG GAAAGT G 
AAAAT AT T TTTGT TTTT GTATTTGAAGAAGAAT GAT GCATTTT GACAAGAAAT CATATATGTATGGAT 
ATATT T TAATAAGTATT TGAGTACAGACTTTGAGGT T TCAT CAATAT AAAT AAAAGAG CAGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TATAT TAATAAATTGTACATT TTT CTAATT 
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FIGURE 120 



MATHALEIAGLFLGGVG^GTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
IYDSLLALSPDLQAARGLMCAASVMSFIAFMMAILGMKCTRCTGD 

MWLIPVSWVANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYS I PS HRT TQKS YHTGKKS PS VYSRS Q YV ' 

Signal peptide: 

amino acids 1-17 



Transmembrane domains 



Wf 

CP amino acids 82-101, 118-145, 164-188 



M 
Hi 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT GTCT GAGGGAAAGCT TTGAGGAGTCC TGGACACCCAACTAC 
AAGCAGT GTT CAT GGAGT TCAT TGAATT AT GGCATAGAT C T TGGGAAAAT T GCGGAGT GTACAT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
$$\ GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
||| ATTGAAGCTATAATTTATTTGGACCAAGGAAGCCCTGAAATGAATTCAACAATTAATATTCATCG 
IP CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
'" J ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
^| T T GGAAT GGT TCAC T T AAATGAC AT T T TAAAT AAGT T TAT GTAT ACAT CTGAATGAAAAGCAAAG 

III CTAAATATGTT TACAGACCAAAGTGTGATTTCACACTGT TTTTAAATCTAGCAT TAT TCAT TTTG 

III CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGC TACCAAT CT T T GTACAAT T T GTAAAT GT T AAGAAT T T T T T TTATATC TGT 
TAAATAAAAAT TAT T T CCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCC T GGAAGAATACATCAT GT T TT TCGATAAGAAGAAAT TGTAGGATCCAGT T T TT T T TT TA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTGCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCT TTGAACAATACAAAGGATGGGT TTCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCT TCAAAAACT TAAG TAT AAT CAAT TTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
CT T GACCATAACCATAT CAGCAATAT T GACGAAAATGCT TT TAAT GGAATACGCAGACTCAAAGA 
GC TGAT T C TTAGT T CCAATAGAAT CT CC TAT T T TC T TAACAAT ACCT TCAGACC TGT GACAAAT T 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
T CT T T GC TGGCATGATCAGACT CAAAGAACT TCACC TGGAGCACAAT CAAT TT T CCAAGC TCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCC TGGACC TGGAGCTCCT TACAAAGGCT T GAT T TATCAGGCAAT GAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
T CCAACAAGC TCACATT TATT GGTCAAGAGAT T TT GGAT TC T T GGAT ATCCCTCAAT GACATCAG 
TCT T GCT GGGAATATAT GGGAAT GCAGCAGAAATAT T TGC TCC CT T GT AAACTGGCTGAAAAGT T 
T TAAAGGT CTAAGGGAGAATACAATTAT C T GTGCCAGTCCCAAAGAGCTGCAAGGAG TAAAT GT G 
ATCGATGCAGTGAAGAACTACAGCATCT GT GGCAAAAGTACTACAGAGAGGTT TGATCTGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCAT^AAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAAT TT TATGTA 
GAT TATAAACCCACCAACACGGAGAC CAGCGAGATGC TGC T GAAT GGGACGGGACCC TGCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT TTTCCTCT C AT AC AT AAT C AAC C CAT T GAAAT T TAAAT AC CAC AAT CAAT G T G AAGC T T 
GAACT CCGGT T TAATATAATACCTAT T GTATAAGACCC TT TACT GAT T CCAT TAATGT CGCAT T T 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 



FIGURE 124 

MG FWI RLLSGSAVALVI APT VLLTMLSSAERGC PKGCRCEGKMVYCE SQKLQE I P S S I SAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNV I DAVKNYS ICGKST TERFDLARAL PKP T FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAATGGAAGGAAAAAGGAAACTGGAAGG 
AAAACCAT TGCGATT TCGTGGT GT ACAT CATGCAT TTGCAAAAATCT TAGCTGAAGGAGGAATAC 
GAG G GC T T T G GGCAGGCTGGG TACC CAAT AT AC AAAGAGCAGCACT G G T GAAT AT GGGAGAT T T A 
ACCAC T TAT GATACAG TGAAAC ACT ACT T GGT AT T GAATACACCAC T T GAGGAC AAT AT CATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 

T GAGTGGAGT CAGTCCAT T T TAA, 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAEIATFPLDLTKTRLQMQGET^ALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGG I RGLWAGWVP 
NI QRAALVNMGDLT T YDT VKHYLVLNT PLEDN IMT HGL S SLCS GLVAS I LGT PADVIKSR IMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGHMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 

CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 

AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGGGGCCGTGGCC 

CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 

GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 

CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 

TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 

GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 

AGGAGC GGAAT GT GGACGATGGGCACAT CAT TAACATCAAT AGCAT GT CT GGCCACCGAGTGT TA 

CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 

GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

AGACACAATTCGCCT TCAAAC T C CACGACAAGGACCC TGAGAAGGCAGCT GCCACCTATGAGCAA 

ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

CATCCAGAT TGGAGACATCCAGAT GAGGCCCACGGAGCAGGTGACCTAGT GAC TGTGGGAGCTCC 

TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 

CACGGGAT ACCAC T T CC T GTCCACACCCCGACC AGGGGC T AGAAAAT T T GT T T GAGAT T T T TATA 

TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 

TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 

TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 

TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 

AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 

AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 

CAGAACAC CAGGGCCTGGCCCAGT GGAT T T CATGGT GATCAT TAAAAAAGAAAAAT CGGAAC CAA 

AAAAAAAAAA 



FIGURE 128 



MARPGMERWRDRIALVTGASGGIGAAVARALVQQGLKW 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHI ININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTH IRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTAC ATGG GCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCT GTAAAAGGCCATGGAACT T TGGGTGAATCACCGATGCCATT TAAGAGGGTT TTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGT TGAG TGC T TGAA ATGA GGAACTGAGAAAAT TAAT TTCT CATGTAT T T TT C TCAT T TATT TA 
T TAAT T T T T AAC T GATAGT TGTACATAT T TGGGGGTACAT GTGATAT T TGGATACATGTATACAA 
T AT AT AAT GAT CAAATCAGGGTAACT GGGATATCCATCACAT CAAACAT T TATT TT T TAT TC T T T 
T TAGACAGAGTC TCACTC TGTCACCCAGGC TGGAGTGCAGTGGT GCCAT CT CAGCT TACT GCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TTCCATCCATGT TGCTGCAAATGACAGGAT TTCGTTC TTAATT TCAATTAAAATAACCACACAT G 
GCAAAAA 



FIGURE iso 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site, 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 

TT CTGAAGT AACGGAAGCT ACCTT GT AT AAAG ACCT CAACAC TG CTGACCATGATC AGCGCAGCCT GGAGC 
AT CT TCCT CAT C GGGACT AAAAT T GGGCT GTT CCTT CAAGTAGC ACCT CT AT C AGT T ATG GCT AAATCCT G 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
T T GAAAAACTTGCT GAAAGT AG AAAGAAT ATACCT AT ACCAC AAC AGT TT AGAT GAAT TTCCT ACC AACCT 
CC CAAAGT AT GT AAAAGAGT TACATT TGC AAGAAAATAAC AT AAGGACT ATCAC TT ATGATT CACT TT C AA 
AAAT T C CCT ATCTGGAAGAATT ACAT TT AGAT GACAACT CTGTC TCTGCAGTT AGC AT AG AAGAGGGAGC A 
T T CCGAGACAGC AACT ATCT CCGACT GCTT TT CCTGTCCCGT AATC ACCT T AGC ACAAT T CCCTGGGGTT T 
GCCC AG GACT AT AGAAGAACTACGCT T GG ATGAT AATCGC AT AT CC ACTATTT C AT CACC AT CT CT TCAAG 
GT CTCACT AGTC T AAAAC GC CT GGT T CT AGAT GGAAACCT GT TGAACAAT CAT GGT TT AGGT GAC AAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTT ATC T AAGGC AGCT CT AT CG ACT GGATATGT C CAAT AATAACCT AAGT AAT T TACCT C AGGGT ATC T T T 
GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGTATTGCGGGTGCAAGATGAAATG 
GGT ACGTGACTGGT T ACAAT CACT AC CT GT GAAGGT CAAC GT GC GTGGGCTC AT GTGCCAAGCCCC AGAAA 
AGGT TC GTGGGATGGCT AT T AAGGATCT C AAT GC AG AACT GT TT GATT GT AAGGAC AGTGGGATTGTAAGC 
ACCATTCAGAT AACCACT GC AATACCCAAC AC AGT GT ATCCT GC CC AAGGACAGTGGCCAGCT CC AGT GAC 
CAAACAGCCAGATATT AAGAACCCCAAGCT CACT AAGGAT CAACAAACCACAGGGAGTCCCTCAAGAAAAA 
CAAT TAC AAT TACTGT GAAGTCTGT C ACCT CTGAT ACC AT TCAT AT CT CT T G GAAACTT G CT CT ACCT ATG 
ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
AAC AGG GGAACGC AGT GAGT ACTT GGTC ACAGCCCT GGAGCCT G AT TC ACCCT ATAAAGT AT GCAT GGTT C 
CCAT GGAAAC C AGC AACC TCTACCT AT T TG ATGAAACTCC TGT TTGTAT TGAGACT GAAACT GC ACCC CTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GC AG AAGCT GGCACTAAG AAGGAC AACT CT ATCC TGGAAATCAGGGAAACTT CT TT T CAG ATGT T ACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVS IEEGAFRDSNYLRLLFLSRNHLSTI PWGLPRT IEELRLDDNRI ST IS S PSL 
QGLTSLKRLVLDGNLLNNHGLGDPCVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSN^LSNLPQGIFDDLDNITQLILRIMPWYCGCKMKWVRDWLQSLPV 
KVNVRGIMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKT I T I T VKSVTSDT IHI SWKLALPMTALRLSWLKLGHSPAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPIJ^IIGGAVALVTIALIJJ.VCW.YVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
I LE I RETS FQMLP I SNE P I S KEE FVI HT I FPPNGMNLYKNNHSES S SNRS YRDS G I PDS DHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 123 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 

GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 

TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 

GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 

CC T CC T CAC T GCCT GGGC T GGACAC TGC T GAAAGTAAAGCCACCAT T GCAGACC T GAT CC TGTCT 

GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 

GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 

TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 

CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 

GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 

CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 

GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 

AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 

GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 

AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 

TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 

AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAAT TATCTAAAGC TAT TCAA 

TATCAGCAGCATT TT TCGAGGAGAGTGAAGAGGCGAGAAAAACAAT TTCCAGATTCTCGCTCTGT 

TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 

AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 

TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 

CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 

GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 

ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 

GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 

CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 

T T TAGT CC T CAT CCC T TAGATCCT GGAGGGCACGGAT CACATCCT GGGAAGAAGGCATC TGGAGG 

ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 

GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGIjARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myr i s toyl a ti on s i tes . 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT TTGATCCCGTTCGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCGTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGAC GGGAAATGGAGCAGT CAATGAATAT GC T GAATT CCAACCAT GAGT T GCC TGAT 
GTT T CTGAGT TCAT GACAAGACT CT T CTCT T CAAAAT CATC TGGCAAAT C TAG CAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGC7VACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAATAT T GAT GT CAC T GAAT TAAT TACAGTGT CC TATAGAAAAT GCCAT TAATAAAT TAT 
ATGAACTACTATACATTATGTATAT TAATTAAAACATCT TAAT CCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 



FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYVVEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VTOLPYPLQMKSSGPPSYFIKRESWGWTDFIJ^PMVMMMVLPLLIFVLLPKVVNTSDPDMR 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites . 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGT GTGATCGCCACAAACCTT CAGGAAATACGAAATGGAT TT TCTGAGATACGGGGCAGT G 
TGCAAGCCAAAGATGGAAACAT TGACATCAGAATCTTAAGGAGGACT GAGT CT TTGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAAC TACCAGACCCC TGACCAT T AT ACTCTCCGGAAGAT CAGCAGCCTCGCCAAT TCC T TT C T TA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAATACAGCCAGAT T CTGAGTCACT T T GAAAAGC TGGAACCTCAGGCAGCAGT T GT GAAGGC 
T TT GGGGGAACTAGACAT TC TT CTGCAATGGATGGAGGAGACAGAATA&GAGGAAAGT GATGCT G 
C TGC TAAGAATATTCGAGGTCAAGAGCT CCAGTCT T CAATACC T GCAGAGGAGGCAT GACC CCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGGATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTT TCTGCTAT TGGATATAT T TATTAGT TAATATAT TTAT TTATTT TTTGCTAT TTA 
AT G TATT TAT T T TT T TAC T TGGACATGAAACT T TAAAAAAAT TCACAGAT TATAT T TATAACC T G 
ACT AGAGCAGGT GATGTATT TT TATACAGTAAAAAAAAAAAAC CTT GT AAAT TC TAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATT TCTACAAATAAAGT TT TCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSE I RGS VQAKDGN I DI R I LRRTES LQDTKPANRCC LLRHLLRL YLDRVFKNYQT PDH YTLRKI S 
SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGT GAAAT T TAT GCCC TACACAAC T TACC T TGTGGAAA&AGGAGCAT C TCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC TGGAAAAAGAATAGTC T T TTATGGAGAT GAAACC TGGGT TAAAT TA 
T TCCCAAAGCAT T T TGT GGAATATGAT GGAACAACC T CAT TT T T CGT GTCAGAT T ACACAGAGGT 
GGAT AATAAT GTCACGAGGCAT T T GGATAAAG TAT TAAAAAGAGGAGATT GGGACATATTAATCC 
III TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
f | AAGCTGAGCGAGATGGACAGCGTGC T GATGAAGATCCACACCT CAC TGCAGT CGAAGGAGAGAGA 

JJt GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
fl GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
III CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
81 ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
| T i CAATGAGAGAGCAGTTGAGAT TT TTACATT TGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 

\ 5 AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
T| GAAC TGGATCAGACTGTAC T TGGAGGAAAAGCAT TCAGAAGTCCTAT T CAACCTGGGC TCCAAGG 

* : TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
Q TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Iff CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
5| CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 

GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
W AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
0 TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
III CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 

T CGGACAGCCTCCCAGCAGAGG TGT GGGAGCTGCAGC T GAGGGAAGAAGAGACAATCGGCC T GGA 
CACTCAGGAGGGTCAAAAGGAGACT T GGT CGCACCAC T CATCCT GCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CT GACC GAGACAC TCACAGC TT T GTCAT CAGGGCACAGGC T TCC TCGGAGCCAGGATGATCT GTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CT GCACACAGTAT GTAGT TACCAAAAGAATMACGGCAATAATT GAGAAAAAAAA 



App_ID=10063617 



Page 291 of 320 



FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWI VL I DALRDDFVFGSKGVKFMPYT T YLVEKGASHS FVAEAKPPTVTMPR I KALMTGS LPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 



II! 



Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 



275-281, 278-284 

m 



Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTT CAGAGTGAAGAACT TAAAC 
CCGAAGAAAT T CAGCATT CAT GACCAGGATCACAAAGTAC TGGT CC TGGAC TCTGGGAATCTCAT 
AGCAGT TCCAGATAAAAAC TACAT ACGC CCAGAGAT C T TC T T T GCAT T AGCCTCATCC TTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTT GGGGT GACAGATAAATTTGAGAACAGGAAACACATTGAATT TT CATT TCAACCAGTTTGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAG GAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAAC T AAT T G T AT AAAAAC AC C AAAC C T G C T C AC T 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKT THT FRVKNLNPKKFS I HDQDHKVLVLDS GNL IAVPDKNY I RPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRR.QVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFS FQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



CT AGAGAGTATAGGGCAGAAGGAT GGCAGAT GAGT GAC T CCACAT CCAGAGCTGCCTCCC T TTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAGTT T GCTGTCCACACATT CAACCAACAGAGCAAGGACTACT AT GCCTACAGAC T GGGG 
f*fe CACAT CT T GAAT TCC TGGAAGGAGCAGGT GGAGTCCAAGACTGTAT T CTCAATGGAGC TAC TGCT 

SI GGGGAGAACT AGGT G T GGGAAAT TT GAAGACGACAT TGACAACT GCCAT T T CCAAGAAAGCACAG 

!:!: AG C T G AAC AAT AC T T T C AC CTGCTTCTT C AC CAT C AGC AC C AG G C C C T GG AT GAC T C AG T T C AGC 

rj CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
Q\ GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
M TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
^ ACACATCT T GAGCC TAATCAT GTAGT GTAGATCAT TAAACATCAGCAT T T TAAGAAAAAAAAAAA 

L A?AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



ry 
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FIGURE 14.4 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEH3WMARYLPATVEFAVHTFNQQSKDY 
YAYRLGH I LNSWKEQVES KT VFSMELLLGRTRCGKFEDD I DNCHFQE S TE LNNT FTC FFT I S TRP 
WMTQFSLLNKTCLEGFH 



Important features of the protein: 
Signal peptide: 

amino acids 1-25 

m 

j^f N-glycosylation sites. 

5|I amino acids 117-121, 139-143 



01 N-myristoylation site. 

amino acids 9-15 



If? 
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FIGURE 145 



CTGT GCAGC T CGAGGC TCCAGAGGCACACT CCAGAGAGAGC CAAGGT T CT GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
j** CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 

CJ! TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 

&h 

AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 

fit — 

Tl CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 

111 

f|% TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

fa CCGT CACT CCAT TC TCAGCCCC TAGCAGAGCGTC TGGCACACTAGAT TAGTAGTAAAT GCTT GAT 

SI GAGAAGAACACATCAGGCAC TGCGCCACCTGCTTCACAGTACTT CCCAACAACTC TTAGAGGTAG 

5 GTGTAT TC C CGT T T TACAGATAAGGAAACT GAGGCCCAGAGAGCT GAAG TACTGCACCCAGCAT C 

Jgf . ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

IH 

GTCCAAT T CCAGAGC TGTC T GGTGAT CAC T T TAT GT C TCACAGGGAC CCACATCCAAACATGTAT 
||| CTCTAAT GAAAT T GTGAAAGCT CCAT G T T TAGAAATAAATGAAAACACCTGA 

Wl 

Iff 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEA1WTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 




Transmembrane domain: 



amino acids 157-171 



N-glycosylation sites. 



amino acids 98-102, 110-114 




Tyrosine kinase phosphorylation site. 



amino acids 76-83 



N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Amidation site. 



amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T TTCAGAT TCAGGGGGTACAT GTGAAGGT T TGTT T TAT GAGT AT AT T GCATGAT GC T GAGG 
TTTGGGGT 



P 5 
ft* 



m 



rii 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



\*i N-myristoylation site. 

' iU ■ amino acids 62-68 

0 
m 

W 

CP 

si 



0 
if! 

III 
0 
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FIGURE 14Q 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGT GCAGAAAGACCTT TGAAT GAA.CAGATTGCT GAA 
GCAG7VAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCTCCCCACT TGATAATAAGT TGAATGTGGAAGAT GTTGATT CAAC CAAGAAT CGAAAACTG 
ATCGATGATTATGACTCTACTAAGAGT GGAT TGGAT CATAAATT T CAAGATGATCCAGAT GGT CT TCA 
T CAACTAGACGGGACT CCT T TAACCGCTGAAGACAT TGT CCATAAAATCGCT GCCAGGAT TTATGAAG 
AAAAT GACAGAGCCGTGTTT GACAAGATT GT T TCTAAACTACTTAATCT CGGCCTTATCACAGAAAGC 
CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
TTAT GAGGAGGATCC CAATAAGCCCACAAGCT GGACTGAGAAT CAGGCT GGAAAAATACCAGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGT CTTGCTAAGGGAGAAAA.CGATGAAACAGTAT CTAACACA 
TTAACCT T GACAAAT GGCT T GGAAAGGAGAACTAAAACCTACAGT GAAGACAACT T TGAGGAACT CCA 
ATAT TTCCCAAAT TT CTATGCGCTACTGAAAAGTAT TGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GAT T ACTAT CATGAAAACACTGAT T GAC TTTGTGAAGAT GATGGTGAAATATGGAA.CAATATCT 
C CAGAAGAAGGT GT T T C CT AC CT T GAAAAC T T GGAT GAAAT GAT T GC T C T T CAGAC C AAAAAC AAGC T 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGT ACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCT TGAAGGATT CCACAAAA 
GAT GATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTAT TTGGAAGC 
CAT CAGAAAAAA.T AT TGAAT GGT T GAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACC TTT 
CAAAGAT GAGAGACT TCAT CAATAAACAAGCT GATGCTT ATGT GGAGAAAGGCAT CCT TGACAAGGAA 
GAAGC CGAGGCCATCAAGCGCAT T TATAGCAGCCTGTAAAAATGGCAAAAGAT CCAGGAGTCTTT CAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE mo 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAED I VHKI AARI YEENDRAVFDK I VSKLLNLGL I TES QAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSE DNFEELQY FPNFYALLKS I DSEKEAKEKE TL I T IMKT LID FVKMMVKYGT ISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE lfu 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATG GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
AT AATAACCAGCT TC TAGC TGGAGGGCT GCATGCAGGGAAGGT CAT TAAAGG TGAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGC T C TATC TT GGTGC CAAGGAAT CCAAGAGC T T CACC TTC TAC CGG CGGGACAT GGGGCT CACC 
TCCAGCTTCGAGT.CGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
Ll TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
%l CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
0 CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
p TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
f|1 GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 

GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
J|j TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
p GAAAT GGC T CGAGC TCAGAAGATAAAAGAT AAGTAGGGTATGCT GAT CCTC T TT TAAAAACCCAA 

M GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
*,J AGACC TACT TACAAAGTGGCATATAT T GCAAT T TAT T T TAATTAAAAGATACC TATT TATATAT T 

TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
l 4 . AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
W AT TC CT T GTATAAAAATAAGAAAAGAAAT TAAT CT T GAGGTAAGCAGAGCAGACATCATCT C TGA 

Ipl TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
gj T T GTAGTAGT GATCAGGAAACAGAT CT CAGCAAAGCCACTGAGGAGGAGGCT GTGC TGAGT T T GT 

C\ GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 

GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
M ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
f|| GT CT C T GCAGATGTAG T TAGT TAAGACAAGGTCATGCTGGATGAAGGT AGACC TAAAT T CAATAT 

GAC T GGT TT CC T TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGTA 
AAGAT GAAGGCAGAGATCGGAGTT TTGCAGCCACAAGCTAAGAAACACCAAGGAT TGTGGCAACC 
AT CAGAAGC TTGGAAGAGGCAAAGAAGAAT TC T T CCCTAGAGGC T T TAGAGGGATAACGGC TC TG 
CT GAAACCT TAAT C T CAGAC TT CCAGCCTC CTGAACGAAGAAAGAATAAAT TTCGGCT G T T T TAA 
GCCACCAAGGATAAT T GGT TACAGCAGCTCTAGGAAAC TAATACAGC T GCTAAAATGAT CCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATAT GGCAGAAGTGATGGCATGCCACT TCCAAGATTAGGT TAT AAAAGACAC TGCAGCT T C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGT71AAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGT GAACCTAGAAGCAGAGACTC TGT GAGATAAT CGATGT TTGTTGTTTT 
AAGTTGCTCAGT TTTGGTC TAACT TGT TATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLIJVGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-rayristoylation sites* 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 152 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
H TAT T GAAGG TGATGACCT GCATATCCAGAGGAATGTGCAAAAGC T GAAGGACACAGTGAAAAAGC 

U T TGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGT TTAT GTCTCTGAGAAAT 

GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
?[j CAAT TAGATGCCCCAAAGCGATT TTTT TTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 

fll ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
?** AGACCAGAAGGTAGAC T T TC TAAGCATAGAT AT T TAT TGATAACAT T T CAT T GTAAC TGGTGT TC 

;?l TATACACAGAAAACAATTTATT TT TTAAATAATTGTCT TT TTCCATAAAAAAGATTACT TTCCAT 

T CCTT TAGGGGAAAAAACCCCTAAATAGCT TCATGT T TCCATAAT CAG TACT TTAT ATTTATAAA 
= || TGTATT TATTAT TAT TATAAGACT GCATTT TATT TATATCATTTTAT TAATATGGATT TAT TTAT 

ff| AGAAACAT CAT T CGATATTGCTACT T GAGT GTAAGGC TAATAT T GATAT TT ATGACAATAAT TAT 

III AGAG C T AT AAC AT G T T TAT T T GAC C T CAAT AAACAC T T GGAT AT C C C 

m 

; f St 
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FIGURE 154 

MAALQKSVSS FLMGTLATSCLLLLALLVQGGAAAP I S SHCRLDKSNFQQPYI TNRTFMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLL5HSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAGC CGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGC T TT GGGGAAAACCT GCAC T TCT GCACAT T T T GAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGT T-TGT T T AC TCATCACT CAGT GAGCAT CTAC T TT GGGT GCAT TCTAG T GT AGT TAC TAGT C TT 
T TGACAT GGATGAT T C TGAGGAGGAAGCT GT TAT TGAATGT ATAGAGAT TTATCCAAATAAATAT 
C T T TAT T T AAAAATGAAAAA 



FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



Q 

CI 



TV* 

m 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 

GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 

CCGGAGACTTGAGGGACCTCCGAGTAGAACCTGTTACAACTAGTGTTGCAACAGGGGACTATTCA 

ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 

GATT TGTGTGAC GGGCAAAAGCAAC T TCCAGT CCTACAGCT GT GTGAGGT GCAATTACACAGAGG 

CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 

GAGCTGAACACAGTCTAT TTCATTGGGGCCCATAATATT CCTAATGCAAATATGAATGAAGATGG 

CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 

AGT GTGT CAAGGCCGGAAGCC T GTGGGATCCGAACATCACT GC T TGTAAGAAGAATGAGGAGACA 

GTAGAAGT GAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCT CT TATCCAACACAGCAC 

TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 

TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 

GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 

CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 

TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 

AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGT GCAGT GGCT T GCCAC TCAAAAGAAGGCA 

GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 

GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 

GTGAT CT AAGAAGCCAGAT TCATCTGCACAAATACGT GGTGGTCTAC T T TAGAGAGAT TGATACA 

AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 

CTGTGCAGAACT TC TC CAT GTCAAGCAGCAGGTGT CAGCAGGAAAAAGATCACAAGCCT GCCACG 

ATGGCTGCTGCTCCTTGTAG 
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FIGURE 1*8 



111 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWI^Q 

LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPISilTACKKNEETVEVNFTTTPLGNRYMALIQH 
ST 1 1 G F SQ VFE P HQ KKQT RAS WI P VT GD S E G AT VQLT P Y F PTCGS DC I RHKGT WLCPQTG VP FPLDNN K 
SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNL FC S DL RS Q I HL HKYWV Y FRE I DT KDD YNAL SVC P KY HLMKDAT AFCAE LL H V KQQVSAGKRSQACH D 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 



|;|| cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATGA.CAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 

ACTCAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGIAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site* 

amino acids 83-87 

N-myristoylation sites . 

amino acids 106-111, 136-141 
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FIGURE 161 

ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCGTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTGTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
; , TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
Q ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGGAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
p GTGCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
T"u CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
::Jf CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
*# l CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
H» GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
\,f TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
HI CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
f;j CCCCATGGACAAA.TACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
J!s CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
gf CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
1*1 CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
111 CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 

CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGACGCTGT TT TT CTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QV P AAL VQ FGQ S VG S WY DC FE AALG S E VR IWSYTQPRYEKELNHT Q QL P AL PWLN VS ADGDNVHLVLNV S 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLF^^ 

RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVVVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 

GGGAGGGCTCTGTGCCAGCCCCGATGA.GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACAT CCT GACGT GGGACAGCGGGCCAGAGGGCACCCCAGACACGGT CT ACAGCAT CGAGTATA 
AGACGTACGGAGAGAGGGACTGGGT GGCAAAGAAGGGC T GTCAGCGGAT CACCCGGAAGT CCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGT GCGGGAGGCCGG T CAGCCACCAAGATGAC T GACAGGT TCAGCT C T CTGCAGCACACT AC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TACCACT TAGAGC TCCAGGTCAACCGCACC TACCAAATGCACC T TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CC TGGGCCAAGGAGAGT GCCCCC TACAT GTGCCGAGT GAAGACACT GCCAGACCGGACAT GGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

Hi AG T C T GG C CCAGCC T G T C CAG TAC T C C CAGAT C AGGG T G T C T GGAC CCAGG GAG C CCGC AG GAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
GTCGGGCCCCCATCCTATGCACCTCAGGTGACGCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
CCCTGGGGAT T TGCACAGACAGAACAT CT GACCCAAATGT GCTACACAGT GGGGAGGAAGGGACA 
^ CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
0 CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
h\ TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
in GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
J£ AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
S H ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCC T GGAAAAGAACCAGGCCTGGAAZ^AGAACCAGAAGGAGGC TGGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T TCC CAGC CAGGGCAACTGCC TGACGT T GCACGAT T TCAGCT TCAT TCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCT TTTCTGCAGGCAGGAGTT TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
C TCACCACT CACAAGCT GT GTGAC TTCAAACAAAT GAAATCAGTGCCCAGAACCT CGGT T TCC T C 
ATCTGTAATGTGGGGATCATAA.CACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 



1 ■£= 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
E FLGT IMI CVPTWAKESAPYMCRVKTLPDRTWT YS FSGAFLFSMGFLVAVLCYLS YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQE7VKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 



111 



Signal sequence. 



amino acids 1-17 



Transmembrane domain. 




amino acids 233-250 



N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 



N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAAAAAAAAAAAAAAA?UU\AAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTTGAACACAGTTTTGGATATTTTCCAAAAGATTTGATCAAGGTACTTCATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTT TGT CTGCT TTGAAGGAGGAAGAGATGAT T 
TTAATAGT TATAATGTAGAAGAGCTTTTAGGATCTT TGGAACTGGAGGACTCTGTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 

CAGAGAGCACCGAGGGGCTGCAGGGACAGCCC TCAGCT CAGGAGAGC CACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGAT GCT TACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGTGCGTTATT 
CAT TACAGCAAAGGATT TCGT T GGCATCAAAAT CTAAGT T T GT T T TACAAAGAT T GT T T T TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 

AAAAAAAAAAAAAAAAAAAA 



f 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKIAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSES TEGLQGQPSAQESHPHTS GPAANAQGVQS SLDT FEE I LHDKLKVPGSESRTGNS SPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYS KGFRWHQNLS L FYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGG CAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCAT GT TGCAAAACCTGACAAAACCCGGG 
Ll ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
%l TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
W ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
Q ■ CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
||! CAAG TAT GAACCAGAGAT TAAT CCAACAGCCCC TGTAGAAAAGCCT TAT CT T ACAAATCAACCAG 

|,| GAGACACCCATCAGAAT GT GGT TGT TAC TGAAGCAGGTAT AAT TCCCAAT C TAAT T TATGT T GT T 

J? ATACCAACAATACCCCTGC T C T TAC T GAT ACT GGT T GCT T T T GGAACCT GT TGT T TC CAGAT GC T 

f* GCATAAAAGTAAAGGAAGAACAAAAACTAGT CCAAACCAGT C TACAC T GT GGAT T TCAAAGAGTA 

M» CCAGAAAAGAAAGTGGCAT GGAAGTATAATAAC TCAT TGACT T GGT T CCAGAAT T T T GTAAT TC T 

%l GGAT CT GTATAAGGAAT GGCATCAGAACAATAGC T T GGAAT GGC TTGAAATCACAAAGGAT C T GC 

AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
y AC T TCAAAC T T CAAGCAAATGAAATGGACAAT GCAGAT AAAGT T GT TAT CAACACGT CGGGAGTA 

111 TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
f| TGTATAT T GT AT TGAAAT T TACAGT G TGCAAAAGTAT TT TACC T T TGCATAAGT GT T T GAT AAAA 

14 ATGAACTGT T C TAAT AT T TAT T T TTAT GGCATC TCAT TT T T CAATACATGCT CT TT TGAT T AAAG 

lit AAACT T AT TAC TGT T GT CAAC TGAAT T CACACACACACAAATATAGTACCAT AGAAAAAGT T T GT 

:f! TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
HI AATAAGAAGCTATTTCAT TAAGTGTGATATAAACCTCCTCAAACATT T TACTTAGAGGCAAGGAT 

T GT C TAAT T TCAAT TGTGCAAGACAT GT GCCT TATAAT TAT T T T TAGCT TAAAAT TAAACAGAT T 
T T G TAAT AAT G T AAC T T T G T TAAT AG G T GCAT AAACAC TAAT GCAG T CAAT T T GAAC AAAAGAAG 
TGACATACACAATATT^lAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 

attgacactggaggcagatagttgcaaagttagtctaaggtttccctagctgtatttagcctctg 

ACTATAT T AGTATACAAAGAGGT CAT GTGGTT GAGACCAGGTGAATAGTCACTATCAGT GT GGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAA?y\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLS LENE AEQKL IE SMLQNLTKPGTGI S DGDFW I GLWRNGDGQT SGACPDLYQWS DGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
P YLTNQPGDTHQNVWTEAGI I PNL I YWI PT I PLLLLILVAFGTCCFQMLHKSKGRTKT S PNQ 
S TLW I SKS TRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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